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CONFESSION OF FAITH: 1955* 


Rap J. WATKINS 
Dun & Bradstreet, Inc., New York 


N THE 1930’s a French observer of the American scene took notice, 
i] in an article in one of our national magazines, of the penchant of 
Americans for statistics, the strong tendency to measure, to count, to 
give expression in numbers. This tendency he summed up in the amus- 
ing characterization that in statistics the American has discovered a 
substitute for ideas, enabling him to dispense with both beauty and 
wisdom. 

For this audience, at the 115th Annual Mecting of the American 
Statistical Association, perhaps the simplest response would be to ad- 
mit with pride the penchant for statistics, but with restrained if mock 
indignation to deny the amusing characterization as both libelous and 
naive in its lack of understanding of the nature and role of statistics in 
our society. The counter argument might proceed with a demonstration 
of some of the many elegant solutions to be found in our work and of 
the beauties inherent generally in statistical distributions and relation- 
ships, followed by the statistician’s appeal to reason, in the choice of 
statistics and techniques, as well as in the aualysis, interpretation, 
and synthesis of his statistical results—the recognition that cerebration 
and balanced judgment must command at all stages: hypothesis, 
experiment, conclusion, synthesis. In perhaps the instance of greatest 
frequency in our craft, we would point out that choice of statistics in 
correlation analysis must rest on an antecedent hypothesis as to causa- 
tion; and that causal explanations of demonstrated correlations must 
rest on causal theory. 

My purpose this evening, however, is not defense. Rather, it is to 
review with you some of the solid progress that is behind us and to 
outline some of the promising trends that point the way ahead. 





* Presidential Address at the 115th Annual Meeting of the American Statistical Association, New 
York, December 29, 1955. 
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In the first publication of the American Statistical Association, 
published in Boston in 1843, the Publishing Committee of the Associ- 
ation deemed it a matter of propriety in its Preface to “call the atten- 
tion of the public to a brief statement respecting the peculiar sphere 
and objects of their labors. For this purpose they avail themselves of 
the principal portions of an Address put forth by the Association at 
the period of its first establishment, in the year 1840; which was drawn 
up with much learning and ability, by Rev. Professor B. B. Edwards 
[D.D.], of the Andover Theological Seminary.” The quoted and en- 
dorsed address defined statistics to be “the ascertaining and the bring- 
ing together of those facts, which are fitted to illustrate the condition 
and prospects of society,” and stated that “the object of statistical 
science is to consider the results which those facts produce, with the 
view to determine the principles upon which the well-being of society 
depends.” The speaker properly noted the establishment of the Sta- 
tistical Society of London six years earlier, “in pursuance of a recom- 
mendatijon of the British Association for the Advancement of Science,” 
and quoted its prior definition of statistics, from which his definition 
differed only slightly, and cited the first number of the London So- 
ciety’s Journal. 

Now these definitions of the Association’s Founding Fathers reflect 
the “political arithmetic” concept of statistics of that period and reveal 
no awareness of methodology or of the coming impact on statistics of 
the developments in the theory of probability. All of this is scarcely to 
be wondered at. Quetelet, the Belgian mathematician and astronomer 
turned statistician, who first fused “political arithmetic” and the calcu- 
lus of probability, had got his training ia probability under La Place 
in Paris not many years earlier. These definitions of our Founding 
Fathers seem far removed also from the concept of “the measurement 
of the uncertainty of inductive conclusions,” the unique feature of 
statistics discussed in Professor Snedecor’s Presidential Address before 
this forum seven years ago; and far removed from the modern technical 
concepts represented in the titles to many of the papers at this present 
meeting of the Association. 

There are, nevertheless, several aspects of these definitions of 115 
years ago, which merit some notice as well as approbation. First, I 
would point out that our Founders were,cuncerned not merely with the 
measurement of the existing “condition” of society but gave recogni- 
tion also to measurement of “prospects,” or irends or projections, as 
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we might put it. Any later tendencies on the part of statisticians to- 
ward concentration on a backward-looking process cannot fairly be 
charged to the Founding Fathers. 

Second, I would note that our Founders were not concerned with 
statistics for statistics’ sake but rather had a positive, purposeful, and 
scientific object in view, namely, “to determine the principles upon 
which the well-being of society depends.” 

Our Founders’ dedication to purposeful statistics found reflection 
again in the Association’s second publication, issued in 1845, on the 
population of Massachusetts for the years from the establishment of 
the colony up to the beginnings of the Federal Census. In a prefatory 
note the deficiencies of the data were duly noted—setting a pattern 
from which there has been scant deviation by members of our craft 
since that time—but the need for the information “in this department 
of . . . political economy” was stressed ; and in the closing words it was 
said of the publication that “public usefulness is its primary object.” 

Third, I would call attention to the early linkage of statistics to 
political economy. On the first page of this second publication, it was 
noted thaé “the earliest history of nations shows, that they have ever 
regarded their inhabitants as of the first importance in the science of 
political economy.” And after a paragraph of discourse on population, 
emphasizing the young nation’s abundant resources “ . . . not only for 
the natural increase of our own people, but, also, for the multitudes, 
who annually throng to our shores,” the paragraph closed with this 
observation in the field of political economy: “It will be a pleasant sign 
in the knowledge of our Republic, when writers, like Smith and Mal- 
thus, shall arise and be encouraged to publish their researches, specula- 
tions and conclusions.” Here, then, we see in the beginnings and funda- 
ments of our craft, the close linkage of statistics with economics—-of 
the “political arithmetic” of the day with the political economy of the 
day. Presently I shall have more to say on that continued linkage to this 
day. 

Despite this close linkage with political economy, it should be made 
clear that our Founders recognized the all-embracing scope of statis- 
tics. In the quoted and applauded words of Dr. Edwards in 1840, “sta- 
tistics are closely allied to the other sciences, and receive contributions 
from all of them.” The 18 sciences and fields enumerated in that address 
included geography, geology, agriculture, zoology, industry, commerce, 
botany, horticulture, chemistry, medicine, mechanics, meteorology, 
physiology, population, education, finance, government, and religion. 
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Our Founding Fathers staked out, I believe we can say, a sufficiently 
broad scope in terms of subject matter fields. Perhaps we may add, 

‘with tongue in cheek, that we still have, 115 years later, ample lee-way 
for the formation of sections! 

There is one further aspect of the Association’s beginnings I wish to 

call attention to, and that is the scientific detachment with which the 
Founders approached the problem of recording the American experi- 
ment with free political institutions. Follow these words from the 
Preface to the Association’s first publication: “Another advantage 
which may be expected to flow from statistical inquiries ia this country 
has relation to our peculiar civil institutions. It is of +he utmost impor- 
tance that, while the experiment of free political institutions is in prog- 
ress, all the facts pertaining to this experiment should be carefully 
~ gathered up and recorded. The whole process of the experiment should 
be accurately marked and faithfully described by competent observers, 
at the time, and on the spot. In this way if we succeed, it will be known 
why we succeed; if we fail, the causes of this failure will be apparent. 
Thus we shall have faithful observations and unquestionable facts on 
which to rely, instead of being left to guess at the reasons, and substi- 
tute speculation in the place of certain knowledge. How inestimably 
precious would be a thorough statistical view of the ancient Grecian 
Republics, a view which should enter into all the details. ... The 
ancient historians leave us in complete darkness in respect to a thou- 
sand interesting questions. ... Our course sheuld be more intelligent 
and benevolent. If we value our institutions, we should be able to 
show good grounds for our favorable estimate, not only in our actual 
prosperity, but by our recorded knowledge, by facts well ascertained, 
and philosophically classified. And this duty cannot be safely left to 
incividuals. It is only by means of associated labor, that the proper 
results can be secured.” 

There is here, it seems to me, not only a clear reflection of the scien- 
tific spirit but a sense of destiny in the ring of those words, a sense of 
destiny reminiscent of the pen of Alexander Hamilton in 1787 in the 
Federalist, No. 1, writing of “... the fate of an empire in many re- 
spects the most interesting in the world,” and observing that, “it has 
been frequently remarked that it seems to have been reserved to the 
people of this country, by their conduct and exemple, to decide the 
important question, whether societies of men are really capabie or not 
of establishing good government from reflection or choice, or whether 
they are forever destined to depend for their political constitutions on 
accident and force.” 











CONFESSION OF FAITH: 1955 


ire II 

Before this forum in 1953, in his Presidential Address on the present 
structure of the Association Professor Cochran pointed out the great 
broadcning of the Association’s activities into many subject-matter 
fields, in contrast to the earlier history of concentration of the Associ- 
ation in the fields of business and economic statistics. That broadening 
of scope into many fields is part and parcel of the development of the 
science of statistics; and has indeed been accompanied by increased 
venetration in depth in each of the several fields. Likewise, this broad- 
ening process has brought with it tremendous gains, actual and poten- 
tial, in the form of cross-fertilization of fields, the transfer of statistical 
techniques from one subject matter field to another, the exchange of 
data and design; and in general has opened up great possibilities in 
inter-disciplinary research. 

In a sense, this broadening is part of the process of growing up, of 
maturing. Great advances have been made in statistical techniques 
and methodology; and as Professor Wilks noted in his Presidential Ad- 
dress in 1950, significant progress has been made in graduate and re- 
search training in both applied and theoretical statistics in numerous 
subject-matter fields. Stimulating and paralleling in a measure these 
advances and lines of progress, the Association has come of age and 
has extended iis scope and activities accordingly. This is a development 
that has immensely enriched each field embraced in the Association’s 
activities, and it is a development of which we can all feel proud. 
Whatever loss has accrued in the field of business and economic sta- 
tistics, from the cessation of concentration of Association activities in 
that field, has been more than offset by the gains in depth and maturity 
in the field and by the gains from cross-fertilization. 

My own interest and membership in the American Statistical Associ- 
ation extend back about three decades, to the middle 1920’s; and the 
strongest impression I have from those years of interest and member- 
ship is one of tremendous progress in the field of statisties, both applied 
and theoretical, and in the Association’s work, both in scope and in 
depth. That progress is part of the heritage we celebrate at this 115th 
Annual Meeting; and I believe the Association may rightly claim a 
significant share of the credit for that progress. I am confident that the 
membership of the Association generally joins me in the fervent hope 
that the Association will continue over the years as the central scien- 
tific and professional society concerned with both applied and theoreti- 
cal statistics; that it will be ever sensitive to the needs of the new or 
emerging interest groupings, as well as to the continuing needs of the 
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established groupings; and that generous-spirited and hospitable 
moves will always be made to welcome back to the central fold, by 
association, closer association, or otherwise, interest groupings that 
have drifted away. There is room in our central fold for all such group- 
ings in the fields of applied and theoretical statistics; and it is in the 
mutual interest of all statisticians, and indeed in the interest of society, 
that we have a strong central association whose objects are, to quote 
from our Constitution, “ .. . to foster in the broadest manner statis- 
tics and its applications, to promote unity and eifectiveness of effort 
among all concerned with statistical problems, and to increase the 
contribution of statistics to human welfare.” 


III 


Let us return to the subject of the close linkage of statistics with 
economics, or “political arithmetic” with political economy, if you 
wish. As stated earlier in these remarks, that historical linkage persists 
to this day, and will I believe continue, despite the broadening of scope 
in statistics and despite the broadening of the work of this Association, 
a broadening we all applaud for the great gains it has brought to sta- 
tistical |science, to science generally, to American society, and to the 
Association. 

It is, I believe, a development of historic importance in the United 
States that the Congressional Joint Committee on the Economic Re- 
port has with apparent earnestness assumed a leadership role in the 
field of business and economic statistics. Although the Joint Com- 
mittee’s interest is naturally focused primarily on the statistical pro- 
grams of the Federal Government, it is by no means restricted to such 
programs, Rather, the Committee’s work makes clear that it intends, 
very properly in my judgment, to concern itself with the entire field 
of business and economic statistics, insofar as such programs, public 
or private, are helpful in furthering an understanding of the functioning 
of the economy and in facilitating the aims of public policy embraced 
explicitly or implicitly in %".: ‘imployment Act of 1946, under which 
Act the Joint Committee was established almost ten years ago. The 
logic of that interest is crystal clear, since without adequate statistical 
measures the Economic Report could neither be formulated by the 
President and his Advisers nor reviewed and appraised by the Joint 
Committee. 

For the first time in the history of the country, there is a national 
public forum before which the adequacy of programs in business and 
economic statistics can be reviewed and appraised; and through which 
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sound programs can be furthered. The pioneering work done in 1954 
by the Joint Committee’s Subcommittee on Economic Statistics rep- 
resented a salient contribution. The Subcommittee’s hearings and 
reports have helped bring about, I believe, a more intelligent under- 
standing, in the nation generally as well as within the Congress and 
the Executive Branch of the Government, of the place that statistical 
programs occupy in the management of the economy and why we must 
have these statistical measures. 

Again, the Joint Committee, through its ubecommittee on Economic 
Statistics, has made history in 1954 ar.” 1955 through the systematic 
reports on the status and adequacy of bo... public and private economic 
statistics in certain major areas, which reports by designated Com- 
mittees of statisticians and economists have been submitted to the 
Joint Committee under the aegis of the Board of Governors of the 
Federal Reserve System. These reports are, I believe, certain to exer- 
cise great influence in shaping statistical programs in these areas. 
Moreover, they undoubtedly foreshadow similar efforts in the future 
by the Joint Committee in the exercise of its public review and stimu- 
lating functions in the field of economic statistics. 

It is particularly fitting that the American Statistical Association 
take note of and applaud this assumption of a leadership role in the 
field of business and economic statistics by an arm of the Congress. 
This Association has long concerned itself with statistical leadership, 
coordination of statistical programs, and statistical standards; and with 
reference thereto has had a long and intimate relationship with the 
Executive Branch of the Federal Government. 

It was this Association that established the Committee on Govern- 
ment Statistics and Information Services in 1933, whose work led to 
the establishment of the Central Statistical Board by Executive Order 
in 1933, and by statutory enactment in 1935. That Board continued in 
existence until 1939 when its functions were placed in the Bureau of 
the Budget as the Division (now Office) of Statistical Standards. And 
it was a Past President of this Association, Dr. Frederick C. Mills, 
of Columbia University, and a present member of the Council of the 
Association, Dr. Clarence D. Long, of Johns Hopkins University, who 
made a comprehensive review of Federal statistical agencies for the 
first Hoover Commission ard recommended, in part, as follows: 


That there be maintained in the Executive Office of the President an Office 
of Statistical Standards and Services having general responsibility for the 
development and maintenance of a system of statistical intelligence and for 
its economical management. It should have powers necessary to the full 
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coordination of the elements of that system, and for the enforcement of de- 
sired procedures and appropriate standards. This Office should discharge 
che present duties of the Division of Statistical Standards, but should have 
scope for greater initiative and authority appropriate to a broader role. It 
will seek to insure the accuracy, representativeness, timeliness of publica- 
tion, and comparability with other data of all statistical series serving public 
purposes. The work of this Office should be largely confined to supervisory 
and staff functions, with a minimum of routine duties. Its personnel should 
include men of the highest administrative and technical competence. The 
head of the Office shall be appointed by the President. 


Among other functions recommended was that of annually sum- 
marizing and coordinating the operating expenses and appropriation 
requests of statistical agencies and appraising them for the benefit of 
the Bureau of the Budget, the President, and the Congress. Similarly, 
it was recommended that the Committees on Appropriations of the 
House and Senate and other Congressional committees should request 
representatives of the Office of Statistical Standards to testify before 
them concerning appropriations for statistical purpbses and legislation 
involving statistical functions. For the full report, see The Statistical 
Agencies of the Federal Government; A Report to the Commission on 
Organization of the Executive Branch of the Government, National 
Bureau of Economic Research, 1949. 

It is also this Association that maintains, by invitation, an Advisory 
Committee on Statistical Standards to the Bureau of the Budget. 

The leadership role in economic and business statistics assumed by 
the Joint Committee on the Economic Report is a constructive de- 
velopment .. great significance to the economy in general and to sta- 
tistical programs in particular. It must at the same time be noted that 
the Executive Branch of the Government will increasingly find itself in 
an anomalous position if it does not take steps to place the statistical 
coordinating and leadership functions on a plane commensurate with 
their importance in the management of the American economy. Those 
functions were outlined with intelligent comprehension, insight, and 
forward-looking imagination by the first Hoover Commission Task 
Force (Mills-Long) Report I have cited. Although that Report is now 
almost seven years old, I believe a review by the Executive Branch 
of its proper role in statistical coordination and leadership may properly 
start with that Report. 


IV 


The transformations that have taken place in the American economy 
over the past quarter-century and the profound shifts in our thinking 
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that have occurred constitute, in my judgment, one of the great peace- 
ful revolutions of history. It is now taken for granted that we are com- 
mitted, as a people and as a government, to whatever public and pri- 
vate actions may be required to further our progress toward the com- 
panion goals of economic growth and economic stability. 

Since the end of World War II we have experienced two recessions, 
in 1948-49 and 1953-54, and our institutions successfully met the test 
of preventing those recessions from developing into vicious spirais of 
credit liquidation and major depression. That success was, | believe, 
no accident and affords renewed proof of the depth of the transforma- 
tions we have achieved in our economy and in our thinking about the 
economy. The gain to our society represented by these transformations 
is of tremendous significance and is reflected in a pervasive spirit of 
confidence in our future. 

The principal point I wish to stress, however, is that these transfor- 
mations in our economy and in our thinking could not have occurred 
if we had not had a great body of statistical measures of the condition 
and trends of the economy and skilled technicians to analyze those 
measures. The sound management of the American economy is de- 
pendent on good statistical records and on good analysis; and this de- 
pendence extends throughout our society, in private business manage- 
ment as well as in the administration of public functions. 

Both the level and the direction of change of the economy at any 
time are the resu'tants of the myriads of decisions being made every 
day by the spending and producing and governing units that make up 
that economy, including consumers, farmers, business men, labor 
leaders, professional practitioners, and public officials. Each decision 
of that myriad of decisions must be based on information; and it is one 
of the monumental achievements of our society that the measured 
economic intelligence available to our people constitutes a network of 
statistical information unprecedented in scope and richness—howso- 
ever conscious we may be of weaknesses in many of its elements. 

We stress the dependence of good management on statistical records, 
but we are not unmindful of the limitations of our statistical measure- 
ments. These measurements of the condition of the economy can be 
only approximations, for that economy is too massive and too complex 
to afford any basis for claims to perfection or certainty in our economic 
measures. The estimates and indexes that are carefully prepared to 
measure the state of the economy, in terms of such concepts as gross 
national product, national income, industrial production, employment 
and unemployment. represent at any given time the best approxima- 
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tions the technicians can arrive at, on the basis of the information 
available to them at the time and on the basis of the extent of the analy- 
sis that time and funds permitted. These measures are not perfect, they 
cannot represent certainty ; and the best we can claim is that they are 
reasonable approximations at the time and under the given circum- 
stances. A corollary is that our statistical measurements of the state 
of the economy are subject to revision from time to time in the light 
of better information and on the basis of improved analysis. That 
process of revision and refinement is part and parcel of the beckoning 
goal of progress. 

Despite all the limitations in our statistical measures of the economy 
and despite all the human frailties in economic and statistical analysis 
of these measures, the effective management of our economy, public 
and private, must in iarge part rest on them. And the contributions 
made to economic stability over the past quarter century by statistical 
and economic enalysis represent gains which must be measured in 
terms of scores of billions of dollars annually. In social terms these 
gains may be described as the near-universal conviction of our citizen- 
ship that ours is not only a viable economic system but one which in 
large measure achieves the twin goals of security and adventure, of 
stability and growth, of peaceful order and restless dynamism. Here 
indeed is the stuff of progress, the mark of triumph. ; 

The evidence is clear to me that we have wrought transformations 
in the American economy which, given reasonably competent manage- 
ment, make the American economy depression-proof, that is, in terms 
of devastating depressions like that of 1929 to 1933. Through these 
transformations we have achieved in considerable measure an economic 
system of built-in stability. The important point, however, is not that 
professional economists and statistical analysts of the business cycle 
believe that the transformations in the American economy and in our 
thinking about the economy, of which I have spoken, make our econ- 
omy depression-proof against catastrophic depressions. The salient 
point, rather, is that both American consumers and American business 
men generally have come to believe that major depressions are un- 
necessary, that they can be avoided through competent public and 
private management, and that such avoidance is the first domestic 
responsibility of those who manage our affairs. That general convic- 
tion points up a truly sobering responsibility. 

The conviction that catastrophic depressions can be avoided does 
not at all imply a belief that economic recessions are a thing of the 
past. Indeed, we can see t!:at moderate economic fluctuations are part 
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of the price of freedom. They are inherent in a free economy; inhere 
basically in that precious hallmark of a free society—freedom of 
choice; freedom to buy or not to buy; freedom as to what to buy; 
freedom of occupation and freedom of entry into business; and freedom 
in the management of our far-flung business system, composed of more 
than four million separate business concerns. 

The business cycle is still with us and will continue with us so long 
as we remain 9, free economy and a free society, though we may properly 
continue our efforts at lessening the amplitude of economic fluctua- 
tions and may well hope to achieve more and more progress as we learn 
more about the behavior of the economy and as our management skills 
are sharpened by experience. There can be no guarantee, however, 
against recurrent mild recessions and periods of readjustment like 
those of 1924, 1927, 1948-1949, and 1953-1954. But we can take those 
in our stride, precisely because we have learned how to ameliorate 
sorae of the consequences of such recessions at the same time that we 
have learned how to prevent their spira!ling into massive credit liquida- 
tion and deep-dyed depression. 

We may have to face the possibility of the recurrence, at some time 
in the future, of a recession like that of 1938 or even that of 1921; but 
I would say tuat any such contingency would be a sad reflection on 
both public and private management of our economy. Moreover, if 

| such a recession occurred I think it is safe to say that the Administra- 
tion in office at that time would lose the next national election, Con- 
gressional or Presidential. The consciousness of that threat, we may 
: add, is surely a spur to remedial action. 
| The Chairman of the President’s Council of Economic Advisers has 
| wisely observed recently that our economy is at all times subject to 
| the threat of either inflation or recession; and that only through eternal 
vigilance and wise management of the economy can we hope to avert 
those threats. It is the role of economic statistics and quantitative 
economic analysis to supply the indispensable guides that watchful 
vigilance and good management require. It is a not inconsiderable role 
in the evolution of our society. 

















STATISTICIAN AND POLICY MAKER: A PARTNERSHIP 
IN THE MAKING 


Werner Z. Hirscu 
Washington University 


FN 4 recent article, Deming described some of the problems which con- 
4 front the statistician who is giving professiona: evidence in legal 
proceedings.' The statistician’s main difficulty in court is to convince 
laymen of the objectivity and scientific acceptability of sampling re- 
sults. 

In the present paper we would like to refer to some of our experience, 
shared no doubt by many statisticians who act as consultants to busi- 
ness and industry. In this capacity we quite often find it necessary to 
stress the nonobjectivity of statistics to lay executives whose mental 
image of a statistician is that of the detached, impersonal, objective 
scientist. 

While the calculations of the statistician are based on mathematical 
and statistical hws, and the conclusions are therefore scientific and 
our estimates objective, almost everyone will agree that the usefulness 
of any results can be no better than the assumptions underlying them. 
These assumptions, in most cases, are based to a large extent on purely 
subjective value judgments. The statistician thus must explain to 
management what statistics can do, for example, provide estimates 
and decisions in which a known and calculable degree of confidence 
can be placed; and what it usually cannot do, for example, decide 
what degree of confidence would be appropriate in a given situation. 

Let us illustrate the problem by a hypothetical example. A cigarette 
manufacturer wishes to deterraine whether the smoking habits of the 
American public have declined in response to the publicity linking in- 
cidence of lung cancer to cigarette smoking. Management hires a sta- 
tistician to survey a sample of smokers and draw the conclusions. How- 
ever, surprisingly enough, the statistician before getting about his work 
plies the executives with questions. In order to estimate the proper sam- 
ple size, he must know what risks of committing an error of type i and 
of type II the company is willing to take. Furthermore, an agreement 
must be reached as to the interval within which the parameter in que- 
tion (average number of cigarettes smoked per adult after the cancer 
announcement) is expected to lie. These preliminary decisions must 
be made in the light of the expense that goes with getting the sample 
information. 

Those who have faced the problem of obtaining the basic informa- 
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tion needed to estimate the proper sample size have learned that ques- 
tions about a, 8 and the order of magnitude of the parameter must be 
couched in common sense terms. To accomplish this is one of the most 
challenging aspects of the meeting between statistician and executive. 
It is easy to frighten the executive once and for all by making use of 
highly technical terminology or by inability on the part of the statis- 
tician to communicate in simple language. in our hypothetical example 
(and in many actual cases) the statistician wil! explain to the executive 
his procedure of establishing a null hypothesis (“smoking has not de- 
creased”) and an alternative hypothesis (“smoking has decreased”). He 
will make clear to the executive that a sample mean always carries 
some risk of being different from the population mean, and that thus 
there will be, due to the process of sampling, a chance of wrongly ac- 
cepting the null hypothesis and a chance of wrongly rejecting it. 

I am fond of explaining the two kinds of error with the example of 
Bob’s proposal of marriage to Barbara. Barbara can make two mis- 
takes: if she accepts Bob, she may be getting a husband who is wrong 
for her, and if she refuses him she may reject the man who would have 
been the ideal mate. Many executives promptly take to thinking in this 
framework and reply with the example of the jury which may be con- 
demning an innocent man or acquitting a guilty one. This leads to the 
criterion that courts in Anglo-Saxon countries have universally ac- 
cepted, namely that when in doubt, it is better to let the guilty go free 
than convict the innocent. Thus, the courts would usually select a 
smaller 8 than a. 

Ancther example that at present is often brought into the discussion 
at this point, is the decision on the safety of the new Salk-vaccine. If 
the null hypothesis was that there is no live virus in the vaccine, it is 
clear that the probability of declaring a batch of vaccine safe when its 
injection can cause polio must be of the lowest possible order. Yet it is 
also necessary to consider 8 carefully, since it is most undesirable to 
withhold safe vaccine from the public. This example points up the fact 
that the decision about appropriate a’s and §’s depends on the problem 
at hand and on the personal judgment of those facing the decision. 

Once the executive understands the two possible kinds of error, the 
statistician will ask him to consider carefully the consequence of each 
type of error. If the sample survey results in the cigarette smoking ex- 
ample lead to the conclusion that smoking has decreased, on the as- 
sumption that the company’s scientists are thoroughly convinced that 
smoking and cancer are not causally related, what action would man- 
agement take? If it would embark, let us say, on a large-scale adver- 
tising program, would the money expended on this be a total loss to 
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the company in case the conclusions of the survey were erroneous, for 
example in case no change in smoking habits had really occurred? Or, 
would advertising aimed at counteracting the cancer publicity alert 
more people to the fear of an unproved danger? These are some of the 
consequences of committing an error of type I which must be weighed. 
On the other hand, what would happen if from the survey information 
the conclusion is erroneously drawn that the same number of cigarettes 
are smoked on the average as in the previous year? The company would 
sit back and do nothing, at a time when a vigorous advertising program 
could possibly save the situation. The consequences of a “sin of com- 
mission” and a “sin of omission” must be balanced before deciding on 
the appropriate a and @. The statistician must confer extensively with 
management to acquaint it with this framework of thinking and guide 
it to reach appropriate preliminary decisions. Executive and statistician 
will join forces in agreeing on the subjective assumptions and selecting 
corresponding a’s and §’s. Moreover, the executive will be asked to 
furnish a reasonably informed assumption about the range of values 
within which the parameter may be expected to fall. 

About the same holds also where the statistician is called on to make 
confideuce interval estimates. So that a proper sample can be planned, 
statistician and executive wil! have to agree on what precision (width 
of confidence interval) and reliability (level of confidence) they wish 
to get in the estimate. Cost and speed of alternative sampling methods 
and sample sizes will be considered. Jointly, executive and statistician 
will attempt to balance in their minds the risks that accompany wrong 
estimates against sampling cost and sampling speed. 

In all cases there is a need for asking the right questions and for pro- 
viding the best possible answers to them. Asking the right questions 
requires a thorough understanding of statistical methodology; mana- 
gerial experience and mature judgment must provide the answers. Since 
executives with sufficient statistical training are rare, they must have 
the cooperation of the statis*ician. This is especially true of statistical 
decision making where models of alternative actions and outcomes are 
constructed and, on the basis of empirical information, the various ac- 
tions are evaluated in terms of the probability and desirability of their 
outcomes in light of an agreed upon decision criterion. 

In such cases, the statistician will prod the executive to enumerate 
all possible actions open to the company. Such a list will be supple- 
mented by a list of alternative outcomes. There will be cases when the 
executive recalls that a particular action was taken once before but is 
not quite certain as to whether the outcome can be attributed to this 
action or was due primarily to some other circumstance. The statisti- 
cian may wish to get the pertinent data of the case, analyze them, per- 
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haps with the help of a multiple correlation analysis, and advise the 
executive of the results. 

Having completed lists of alternative actions and of alternative out- 
comes, probabilities and desirabilities of the latter are estimated. Once 
more executive and statistician will join forces to decide on what they 
consider are the proper levels of precision and reliability of these esti- 
mates. Under present conditions, especially with our limited knowledge 
of measuring desirability, some of these estimates will not be empirical, 
but will have to be merely intuitive. In making these intuitive evalua- 
tions, the executive will often think aloud. His hunches, his traditonal 
way of thinking and his outlook will go into the selection of that final 
judge, the decision criterion. Much of the company’s philosophy and 
tradition will come to the fore. The executive will state in a general 
way what he considers to be a proper decision criterion, and it will be 
up to the statistician to find a corresponding statistical model, for ex- 
ample, minimaz risk or maximum mathematical expectation model. 

However, it is most important that the cooperation between statis- 
tician and executive does not stop at this point. Most statistical studies 
extend over a period of many months, and in the meantime the execu- 
tive is likely to forget the introductory discussions and lose interest. Of 
course, no detailed recommendations concerning the form of this con- 
tinued cooperation can be given here. The institutional set-up in every 
case will have to be considered. Once the need for cooperation is realized 
by management and research worker appropriate methods are readily 
perfected. It is our experience that in order to retain and rekindle the 
interest of the executive it is desirable to have, as the study progresses, 
a number of meetings in which preliminary results are presented to the 
executive and advice sought on outstanding issues. 

Even in case the partnership between management and statistician 
is perfect in the preliminary stages of the undertaking, once the work 
is completed, usually many months later, and the findings are submit- 
ted, the statistician is generally tempted to retire to the position of a 
detached research worker. Very seldom does he see fit to make policy 
recommendations. This fact is often te reason why the work of the 
statistician is so frequently not implemer-ted, but simply filed away. 
If this happens,the statistician has failed; he has wasted his time and 
the company’s money. 

For this reason, the statistician’s partnership with the management 
should extend further to some final consultations. For the business sta- 
tistician is not an ivory-tower research worker. The hopes, fears, value 
judgments, prejudices and guesses of management shape the assump- 
tions which underlie his work. He himself helped management quantify, 
in as much as possible, these preliminary subjective decisions without 
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which he could never have applied his statistical tools to the problem. 
It is necessary to explain this to management, and it would be highly 
desirable that he should have a chance to state his views of the problem 
when the time comes for the management to consider his conclusions. 
His report should very explicitly explain the assumptions which are in- 
corporated in his findings and what they mean. Having done this, he 
should be free to make a clear-cut recommendation. 

In what institutional aud administrative setting the statistician could 
operate mokt effectively in the company’s decision-making process is a 
problem which needs careful study in each particular case. Once the 
need which this paper is intended to point out becomes clear to man- 
agement (and junior executives with some statistics in their back- 
ground can be very helpful here) it has been our experience that policy 
recommendations included in a statistical report are considered in an 
entirely different light. The problem of the statistician’s place often 
comes to the attention of top management in those cases where his rec- 
ommendations must cut across divisional lines and go beyond the prob- 
lem for which advice was initially sought. If the statistician has done a 
good job of explaining why his work is not abstract and «nnot be de- 
tached but involves the value and business judgments of the executive, 
and if close working ties have been maintained throughout, manage- 
ment will have gained an appropriate point of view from which to con- 
sider the recommendations in light of the initial assumptions on which 
the study was based. Our experience shows that the statistician can be 
most effective in helping in the decision-making process when he is at- 
tached directly to the office of the president or general manager. Other- 
wise jurisdictional difficulties may endanger the implementation of his 
recommendations when they cross divisional lines. 

We are on the threshold of an age when statistical decision making is 
gaining in importance. There appears to be a partnership in the making 
between statistician and policy maker. Whether such a partnership will 
be lasting is not clear at the moment. The statistician, learning more 
and more about the institutional setting of the organization with which 
he is associated, may undergo a metamorphosis which might lead him 
into the policy making field. The executive may acquire more insight 
into statistical techniques. In any case, the need for closer ties between 
managerial ability and knowledge of statistical methodology is becom- 
ing increasingly apparent. The field of activity of the statistician is ex- 
panding. It may well be that this is what Justice Oliver Wendell Holmes 
had in mind, when he told the students of the Boston University Law 
School in 1897: “The black-letter man may be the man of the present, 
but the man of the future is the man of statistics and the master of 
economics.” 











A TEST OF THE ACCURACY OF A PRODUCTION INDEX 


Cuarues F. Carter anp Mary Rosson 
The Queen’s University, Belfast, Northern Ireland 


A comparison of the volume of industrial production in the 
United Kingdom in 1935 and 1948, using the full resources of 
the Census of Production, has recently been published. This 
provides an opportunity of testing the results obtained, much 
more rapidly but on more limited data, by an index of produc- 
tion. The index is shown to have underestimated the growth 
of production, which is in accordance with a priori expecia- 
tions. The divergence between index and Census results for 
particular trades is frequently large; some instances are ex- 
amined. 


N ADVANCED countries, the fullest source of information about the 
i] volume of industrial production is usually to be found in the 
Census of Production. The Board of Trade in the United Kingdom has 
recently prepared [1] tables showing the changes in the industrial pro- 
duction of Great Britain from 1935 to 1948, both at 1935 and at 1948 
“prices.” This work employs, in full detail, the information available 
in the censuses. Even that information gives, of course, only an ap- 
proximate answer—to take one example, where changes in the extent 
of fabrication occur in a trade, information about materials used is 
commonly inadequate to make it possible to calculate a true index of 
“net output at constant prices.” Nevertheless, it is clear that the 
census gives an incomparably better answer to questions about 
changes in the volume of output than any other source. The trouble 
is that it gives that answer only for the years for which full returns on 
physical output are required, and then only several years late: and 
therefore, for the consideration of current trends and fluctuations, we 
have recourse to indexes of production based upon much more limited 
data. But does the limitation of the data introduce a bias? The purpose 
of this note is to test a particular index of production (using every 
known scrap of published information other than the Census) against 
the results obtained from the Census. 

For the months from January 1946 to June 1952, the London and 
Cambridge Economie Service (LCES) computed and published [2] an 
Index of Industrial Production; and an approximate comparison with 
1935 was also given. This Index was terminated in 1952, because it 
was apparent that there would not be any difference in principle be- 
tween the LCES index and the official index computed by the Central 
Statistical Office, which includes certain data not available to private 
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investigators. For the years 1946-1952 the LCES also computed 
annual indexes of the Real National Product [3], the industrial section 
of which was an improvement of the monthly LCES production index 
by taking into account production cata only available annually. The 
Real National Product caleulations also included comparisons with pre- 
war years [4]. 

We have therefore tested the industrial section of the annual index 
against the census comparison of 1935 and 1948 output. This test is a 
severe one, since the period was one of great change and disturbance. 
It must be remembered, too, that the data available to the compilers 
of an index for a comparison with pre-war years are much more scanty 
than those available for purely post-war comparisons. It is likely, 
therefore, that the current official Index of Production will approach 
more nearly to the post-war census reference points—a surmise which 
can be checked later by comparing two full post-war censuses. 

Now an index of production is likely to derive much of its movement 
from the great basic and staple products; it is for these that statistics, 
monthly or annual, are most likely to exist. Minor products will fre- 
quently be unrepresented, and their contribution to net output will be 
assumed to move with some other series. It might be expected that 
staple commodities would be more sluggish than production in general, 
and that an index would therefore be biased downwards in a period of 
expansion; in particular, rapidly growing new products may not have 
been recorded in regular statistics in the first year, even though by the 
second they may have attained substantial importance. 

Some support is provided for this surmise by the data in Table 1. 
There is a preponderance of negative figures (showing that the LCES 
index is teo low) in the last column. In the preparation of that index, 
certain minor trades had to be omitted entirely for lack of data: but 
these omissions are not a major cause ©! divergence, and no systematic 
difference is made in the comparison by omitting the corresponding 
trades from the census. 

Some further light is thrown on the differences by the following dia- 
gram, in which the two results (1948, 1935=100, 1948 weights) are 
plotted against each other for 115 separate trades. The line which 
bisects the angle between the axes is the line of perfect agreement: the 
lines on each of it enclose the points for which the two methods agree 
within 10 per cent, and the outer lines show a divergence of 50 per 
cent of the census figure. The crosses denote the totals of Orders, and 
the cross enclosed by a circle is the grand total—i.e., for industrial 
production excluding building. Sixty-three of the 132 points lie in the 
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TABLE 1 








Index of value added in 1948, taking 1935 = 100. 
Basis for weights—-1948 prices 




















Standard 
a LCES 
“art minus 
7 LCES Census Census, 
Orcer 

as % of 
Census 
Il Mining and quarrying 90.9 87.1 + 4.4 

Ili Treatment of nonmetalliferous 
mining products other than coal 116.3 128.2 — 9.3 
IV Chemicals 146.2 168.9 —13.4 
Vs Metal manufacture 136.1 149.1 — 8.7 
VI__s Engineering, shipbuilding, etc 193.3 184.4 + 4.7 
VII «Vehicles 161.8 171.1 — 5.4 
VIII Metal goods, n.e.s. 134.0 150.8 —11.1 
IX Precision instruments, jewelry 136.1 146.2 — 6.9 
X = Textiles 85.7 94.2 —- 9.1 
XI__—s Leather and fur 81.1 98.4 —-17.6 
XII = Clothing 91.2 78.1 +16.8 
XIII = Food, drink and tobacco 112.2 126.7 —11.4 
XIV Manufactures of wood and cork 88.2 83.2 + 6.0 
XV __s~Paper and printing 86.2 112.4 —23.3 
XVI Other manufacturing industries 166.9 165.2 + 1.0 
XVIII Gas, electricity and water 189.0 174.1 + 8.6 
Torat (excluding building) 121.8 127.6 — 4.5 





range of 10 per cent agreement. It can be seen that there is a distinct 
tendency for the LCES index to underestimate the growth of produc- 
tion: thus, 41 trades are estimated more than 10 per cent too low, and 
only 22 more than 10 per cent too high. The scatter of points also shows 
how unreliable indexes for particular trades can be, even when the 
totals for Orders and the grand total show a more sober stability. 

All these data are based on 1948 prices. It would at this stage be 
an unrewarding labor to compute a complete set of LCES index weights 
appropriate to 1935; but one can get part way by reweighting the 
indexes for Orders, according to the 1935 net outputs. The results are 
shown in Table 2. The large difference made to the census results for 
manufacturing by reweighting should be noted; reweighting by Orders 
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shifts the LCES results about half as much, so that it may be pre- 
sumed that roughly half the difference arises within Orders and trades. 

It may be instructive to examine some of the worst divergences 
between the two results (with 1948 weights) among the individual 
trades. 


(1) Metalliferous mines (Census 65.2, LCES 120.2). The weight in 
the LCES index is given entirely to iron ore. There appears to 
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have been a fall in the proportion of the iron ore total accounted 
for by the higher grades and also a fall in other metals (e.g., 
lead). There is probably also a problem of classifying integrated 
concerns. 

(2) Glass (other than containers) (Census 190.9, LCES 70.8). There 
being no statistics available for flat glass, the LCES index 
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(3) 


(4) 


CY OF A PRODUCTION INDEX 21 


assumed output to be proportional to the rate of housebuilding 
—a poor guess! The trade also includes light bulbs and radio 
tubes. 

Ink (Census 102.1, LCES 42.6). This is classified as “unreliable” 
in the report on the census comparison, [1]. The LCES index 
uses the home consumption of newsprint as a measure of the 
use of printer’s ink. 

Mineral oil refining, and oils and greases (Census 131.5, LCES 
200.4). The LCES index is based on the production of six princi- 
pal items of output, but there is much room for variation in the 
“deg.ee of fabrication” in this trade. 








TABLE 2 
Industrial 
production Manufacturing 
excluding only 
vc hers building (Orders 
(Orders II-XVI, III-XVI) 
XVIII) 
1948 (LCES Index reweighted by Orders) 129.6 127.7 
1948 (Census, 1935 prices) 140.5 140.7 
—and for comparison 
1948 (LCES, 1948 prices) 121.8 123.5 
1948 (Census, 1948 prices) 127.6 131.3 





(5) 


(6) 


(7) 


(8) 


Construcitonal engineering (Census 170.2, LCES 268.1). The 
LCES index uses employment and the production of metal 
teraporary houses; the census calculations fill in what would 
otherwise be a gap by assuming output to move with employ- 
ment alone. 

Motor vehicle repairing (Census 108.2, LCES 192.3). The LCES 
index uses the number of vehicles on the road, with a deduction 
of 20 per cent in 1935 to allow for the lower average age of 
vehicles (and greater ease of repair) at that date. The census 
calculations again fill in a gap by using employment. 

Carts, perambulators, eic. (Census 70.8, LCES 165.6). The 
census figure is inexplicable, the output of the principal prod- 
ucts of this trade having apparently increased. 

Canvas goods and sacks (Census 126.6, LCES 59.3). The LCES 
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index mainly depends on the production and consumption of 
jute yarn, which is evidently not an adequate indicator. 

(9) Textile packing (Census 51.1 on employment, LCES 97.2 on ex- 
ports). Both calculations use imperfect indicators. 

(10) Milk products (Census 140.0, LCES 54.2). The difference ap- 
pears to be due to the inclusion in the census of the sterilizing, 
cooling and bottling of milk. Between 1935 and 1948 cheese 
and condensed milk manufacture fell, but liquid milk con- 
sumption rose substantially. 

(11) Fish curing (Census 69.8, or 68.2 including Ice: LCES 102.6). 
The LCES index uses the landed weight of fish in Great Britain 
as an indicator. The proportion of fish cured has apparently 
dropped, in part because of the fall in exports of cured fish. 

(12) Cattle, dog, and poultry foods (Census 205.1, LCES 87.7). The 
series used in the LCES index (flour for dog biscuits and 
corn and molasses for feeding stuffs) fail to show the changes 
in this industry. 

(13) Starch (Census 162.4, LCES 400.0). The LCES index uses an 
average production figure for 1934-8, which appears not to be 
representative of 1935. 


This list exhausts the differences which exceed 50 per cent of the 
census figure. Another significant one (which accounts for the large 
difference in Order XV) is: 


(14) Newspaper, etc. printing (Census 106.8, LCES 68.5). The dif- 
ference here is a matter of principle: is a 1948 newspaper the 
same prodvct as a 1935 newspaper? The LCES index gave 
half the weight to employment and half to the consumption of 
newsprint (which had fallen heavily)—in an effort to allow for 
the decline in the size of newspapers. 


The large differences appear, as one would expect, mainly in minor 
trades; and some of them are perhaps due to the inevitable approxima- 
tions in using the censuses of production (e.g., in filling in gaps where 
no adequate physical measures of production exist), rather than to 
inadequacies in the index of production. Nevertheless the size of the 
differences may serve as a reminder that the compilers of monthly and 
annual indexes of production can easily go wrong in their choice of the 
best indicators of production; end that conclusions based on such 
indexes for individual trades and industries must be treated with great 
care. The United Kingdom is on the whole well supplied with produc- 
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tion statistics, and this warning would apply with equal force to most 
other countries. 
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ON SIMPLIFICATIONS OF SAMPLING DESIGN THROUGH 
REPLICATION WITH EQUAL PROBABILITIES 
AND WITHOUT STAGES* 


W. Eowarps Dremine 
New York University 


1. GENERAL DESCRIPTION 


FpcrRrose or Tuts Paver. The purpose here is to explain a simpli- 
fied procedure for the selection of a sample and for the numerical 
computation of the standard errors from the returns, with gains in over- 
all statistical efficiency. There is no new theory here: instead, this is a 
synthesis and rearrangement of well-known principles and practices. 
The procedures to be described have been applied here and abroad in 
various kinds of social and economic studies, including estimates of 
acreage and of yield, in marketing research, and in studies of attitudes, 
in program-listening, in the appraisal of buildings and of other physical 
plants, in the testing of industrial materials, and in studies of account- 
ing records. The main features are these: 
1. Rapid, uniform, and valid computation of the estimates and of 
their standard errors through replication of the sampling procedure. 
Added simplicity, under certain conditions, by use of the range. 





* The author expresses with pleasure his indebtedness to Leo P. Crespi and to 
Fred W. Trembour for the privilege of introducing in September, 1952, the plan 
of duplicate samples into the public opinion surveys carried out by the American 
Embassy in Germany (then the U. 8. High Commission in Bonn). In the original 
experimental work, and in earlier and later drafts of the manuscript and of the 
instructions for the procedure, I had the continual assistance of Josephine D. 
Cunningham, without whom this publication would have suffered indefinite 
delay. 

The illustrations and the tables from the sample of Cincinnati are presented 
through the courtesy of the firm O’Brien-Sherwood of New York. Helpful com- 
ments and criticisms of earlier manuscripts came from my friends and colleagues 
F. F. Stephan of Princeton, Tore Dalenius of the Central Bureau of Statistics in 
Stockholm, P. C. Tang now with the Food and Agriculture Organization of the 
United Nations, Octavio A. de Moraes of the Inter-American Statistical Insti- 
tute, and Nathan Keyfitz of the Dominion Bureau of Statistics in Ottawa. I owe 
a special debt of gratitude to Max Bershad and William N. Hurwitz of the Bureau 
of the Census in Washington for their generous assistance, especially on the esti- 
mate of the variance of a ratio and on the variance of the genera! fuxction. 
Finally, I owe much to D. B. Lahiri of the Indian Statistical Institute in Calcutta 
for several opinions, and for his paper cited in the first footnote. 
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2. Paper zones of equal size, which permit the use of equal probabili- 
ties and the theory of single-stage sampling. 

3. The elimination of the complex formulas associated with multi- 
stage sampling and with unequal probabilities. The elimination of the 
further complexity of the formulas associated with 2 or more drawings 
from one primary unit. 

4. Fine stratification by means of the paper zones. 

5. The economy of multi-stage preparation of the sampling units, 
with simplifications and fewer mistakes in preparation. 

6. The elimination of losses in efficiency and of special consideration 
of primary sampling units of extra-large size, or of extra-small size. 
The paper zones distribute the sample with the population—heavy 
where the popuiation is dense, and light where the population is sparse. 

7. Applicability with little modification (with reduced efficiency, of 
course) to conditions where there are not reliable census data. 

8. Complete freedom in the basic design, such as in the size of the 
work-load, in the size of the segment, and in the modes of stratification. 
The basic design may include without complication (a) two-way 
stratification with forced selection of heterogeneous sampling units; 
(b) randomized segments for heavy pressure on & sample of nonre- 
sponses, or for the measurement of the differences between interview- 
ers, different methods of training, alternative questions, etc. 

9. A simplified field-procedure (optional), which permits the deline- 
ation of any economical or convenient size of segment, and if desired, 
immediate selection for coverage. 

The design to be described in this paper is a combination of features 
all well known separately. For example, replicated samples are the 
foundation of the design of experiment. Replication in sampling has 
not hitherto been so common, yet it is the chief contributor to the 
simplicity of the Tukey plan (vide infra). Replication in sampling goes 
back in fact to the early work of Mahalanobis in 1936 in his surveys of 
jute in Bengal.! The key that opens the door to the advantages of 
replication and to equal probabilities, with the efficiency of strata in 








1 P. C. Mahalanobis, “On large-scale sample-surveys,” Philosophical Tr ctions of the Royal So- 
ciety, 231B (1944), 329-451; “Recent experimencs in statistical sampling in the Indian Statistical In- 
stitute,” Journal of the Royai Statistical Society, CIX (1946), 325-78. Recent papers by D. B. Lahiri velate 
more recent experience in the use of multiple drawings, and the elimination of primary units. The refer- 
ences are, “Technical paper No. 5 on the National Sample Survey” (The Department of Economic Af- 
fairs, Ministry of Finance, New Delhi, March, 1954); published also in Sankhya; 14 (1954), 264-316. His 
recommendations are in many ways parallel to those in this paper. 

Hansen, Hurwitz, and Madow describe for an urban area a selection of sampling units that in prepa- 
ration bears some features similar <o the procedures in this paper, and which may therefore be usefully 
consulted in connection with this paper; see their book Sample Survey Methods and Theory (New York: 
John Wiley & Sons, 1953), vol. I, ch, 8, sec. 6. 
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fine classes, and to the theory of the single stage, is the paper zones 
that all contain the same number of work-loads. The combination of 
features to be described here brings within reach a wider use of prob- 
ability sampling by research organizations and by government offices. 

Although the possibility of showing a valid standard error for an 
estimate is by definition a feature of any probability sample,’ it is a 
fact that results of probability samples have toc often appeared in the 
past without standard errors because of the sheer labor of computation. 
This paper offers a solution of the difficulty. 

The sampling unit. In the exposition presented here, the sample will 
be a survey of a human population. The elementary sampling unit will 
be a work-load, which will be transformed into a segment of area, or 
several segments of area, intended to supply to an interviewer an 
evening’s work, or perhaps two evenings’ work, or a week’s work. 

In other applications, not treated here, but involving no new princi- 
ples, the sampling unit may be an acre, or a small plot in a field, a 
dollar of investment, a dollar of income or of expense, a card or 5 suc- 
cessive cards in a file, a line or 5 successive lines in a ledger, one em- 
ployee on the roll, a group of employees, or a test-unit or a test-panel 
in a shipment of materials. 

Measure of size; the Cdu. It is convenient now to invent the symbol 
Cdu’s for the number of dwelling units that an area contained in the 
last census or according to any later information. Thus an area that 
contained 317 dwelling units at the last census and for which we have 
no new information contains now 317 Cdu’s, even though the actual 
(unknown) number of dwelling units in the area today is different. | 

For illustration, a work-load will be 10 Cdu’s. If the census or other 
information is not too far out of date, the work-loads will average about 
10 dwelling units. If there has been a 10 per cent uniform growth, the 
work-loads will average about 11 dwelling units. 

The size suggested here for a work-load only serves for iliustration, 
although it is based on experience. Moreover, the segment to be as- 
signed for canvass need not have the same size as a work-load. It may 
be half as big, or a third as big. In the section “Suggested field-pro- 
cedure for the selection of segments,” the intended size of the segment 
is, for illustration, 5 dwelling units, and the segments are scattered 
about the area that contains the random work-load. The size of the 
work-load and of the segment should be chosen in any proposed survey 
to achieve efiiciency and smooth performance. This paper need not 





2 The term probability sample and its definition appeared first in the author’s book Some Theory 
of Sampling (New York: John Wiley & Sons, 1950), p. 9. 
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treat the problem of what size of work-load is best for any given set 
of conditions, nor what size of segment is best, as these problems are 
already amply treated by theory in books. 

The procedure will be illustrated first under the assumption that 
census figures exist for small areas. It will be illustrated later by a 
sample of Mexico where there are not such data. 

The frame and the zones. We start with a list of census areas, which 
in urban areas may be sections, tracts, or blocks, and which in sub- 
urbs and in rural areas may be sections, enumeration districts, tracts, 
counties, or some other type of area. This list will be the frame.* The 
frame will show for each area listed the number of work-loads therein. 
The accumulated work-loads, area by area, will give a serial number to 
every work-load. Table 3 provides an example. 

The frame assimilates a long string of beads, each bead being a work- 
load. Each bead bears a serial number. Markers, unequally spaced, 
show the end of one census area and the beginning of another. Other 
markers, uniformly spaced, will divide the entire string of beads into 
zones. 

The number of work-loads (beads) in a zone will be determined by 
the average number of dwelling units or of people in a work-load, by 
the number of replications per zone, and by the tota! number of dwell- 
ing units or of people required in the sample. The symbol Z will denote 
the number of work-loads in a zone. Illustrations occur later. 

The boundaries of a zone will cut across blocks, tracts, sections, 
counties, cities, and strata. The boundaries of a zone never show on a 
map. The zone is merely a convenience for assigning equal probabilities 
to the work-loads in a stratum. 

Some remarks on the order of listing the census areas in the frame. The 
order in which the census areas appear in the frame should serve two 
requirements. First, a zensus area that is smaller than a zone (e.g., 
a block, tract, enumeration district) should if convenient go into the 
same zone with other areas that have similar characteristics (e.g., 
average income, occupation), in order to achieve possible gains from 
stratification. In other words, each zone should be as homogeneous as 
possible. Within a zone, however, the order in which the census areas 
appear is immaterial. 

No problem of stratification arises for an area whose size is as big 
as a zone, as it will be in the sample no matter where it appears in the 





* The frame was defined (but not named) by F. F. Stephan in his article, “Practical problems of 
sampling procedures,” American Sociological Review, 1 (1936), 569-80, as a means of access to the uni- 
verse. The term “frame” came from Frank Yates in a meeting of the United Nations’ Sub-Commiasion 
on Statistical Sampling in 1947. 
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frame. Second, the order in which t’ » zones appear in the frame should 
facilitate tabulation. The zone is the smallest possible building-block; 
any area of tabulation must be built up of zones. 

_ For example, in a sample of a regior the frame might commence 
with the eastern portion (e.g., New England); 1st, the central portions 
of the big cities, in order by size; 2nd, the fringes of these cities; 3rd, 
the smaller cities, followed by the counties in the rural parts, in some 
significant order. This order would give strong geographic stratification 
and would permit separate tabulation for the central portions of all the 
big cities in the east, singly or by size-groups; likewise it would permit 
separate tabulations for the fringes of these cities; and for the rural 
part. It might not be the most convenient order for tabulation of an 
entire metropolitan district. Within a city, or within any other area, 
the requirements of stratification will usually be met if the order of 
listing the census areas (e.g., blocks or tracts) is geographic, or in the 
order of appearance in a census table (often geographic), or by rent- 
levels, or both, 

| Replicated drawings. The main feature of the plan is that there will 
be 2 random selections of work-loads from every zone. The 1st random 
number draws a work-load for Sample 1; repetition with a 2nd random 
number draws a work-load for Sample 2. If the sampling design calls 
for 1 segment to a work-load, then we draw the work-loads without 
replacement; we reject a duplicate random number within any zone. 
If the sampling design calls for the creation of 2 or more segments to 
a work-load, not all to be interviewed, then we accept a duplicate 
random number in the drawing of the work-loads, but we draw without 
replacement the segments for interview within the work-load (as we 
can not interview twice in the same households). Thus, the segments 
are drawn as a single-stage sample without replacement. The propor- 
tion of segments in the sample is 2/Z, which explains the finite multi- 
plier 1-2/Z in the variances later on. 

The two sets of work-loads, when interviewed, will bring forth dif- 
ferent results—differeut numbers of dwelling units and different num- 
bers of males, females, and children. The procedure draws blocks, 
tracts, E. D.’s., and ccunties with probabilities in proportion to 
their sizes in Cdu’s, yet because of the replication with equal proba- 
bilities within a zone, tie theory for the estimates and for their stand- 
ard errors is single stage. 

(More than 2 drawings per zone sre permissible and will be treated - 
later. 

Suggested field-procedure for the selection of segments. A work-load 
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once drawn into the sample will fall in a certain census area, perhaps 
a block, or a section, or other area. A field-worker thereupon goes to 
this area and creates segments, following any workable plan. She may 
use the half-open interval,‘ or she may indicate segments on the map 
furnished to her. She may find that the map requires corsiderable 
internal revision. The desired size of a segment may be 10 dwelling 
units, 5, or 4, or even 1, according to specifications. There are several 
methods in common use for creating segments, and there need be no 
elaboration here.’ Detailed maps, directories, and aerial photographs 
are helpful where the delineation of the segments is to be done in the 
office. 

Whatever the intended average size of a segment, there will usually 
be considerable variations in the sizes of the segments, because the 
pr'mary necessity in delineation is clarity of boundary, not uaiformity 
of size. Fortunately, segments need not contain equal numbers of 
people nor of dwelling units. All the dwelling units in a segment have the 
same probability as the segment of coming into the sample at this 
stage; likewise all the households therein, and ali the people. Inequality 
and variability in the sizes of segments introduce no change in the 
probabilities of selection of the people or of the dwelling units therein, 
and no bias. Substantial inequalities in size wll usually cause a negli- 
gible decrease in the precision of a ratio, and scarcely more in the pre- 
cision of an estimate of a total. In any case, in a probability sample, the 
standard errors are calculable. 

The field-worker may thus create segments one by one until she has 
exhausted the area assigned to her. She assigns to each segment a 
serial nuzaber. She may canvass a sample of segments at once if in- 
structed to do so. Random numbers in a sealed envelope, which she 
will use only after she has defined and numbered the segments, will 
draw 1 random segment from each block of ¢ segments. The number c 
is the segment-interval. Thus, if a block had a size of 9 work-loads, 
then the segment-interval would be 9, wiatever be the sizes of the 





4 In the use of the half-open interval, a segment begins with and includes a certain address such as 
No. 46 Varick Street, and extends up to but does no* include some other address such as No. 64, which 
address would form the commencement of the next segment. It is only necessery for the field-worker 
te list each address that will form the commencement of a segment, and to accompany the list oceasion- 
ally with some brief directions or with a rough map so that the content of every segment will be un- 
mistakable, now or ‘ater. The half-open interval was invented but not named by F. F. Stephan in the 
article cited in footnote 3. The name comes from Deming, op. cit., p. 82, and independently from Frank 
Yates, Sampling Methods for Censuses and Surveys (New York: Charles Griffin and Company, 1949), 
p. 67. 

5 Cf. footnote 1, Mahalanobis and Hansen, Hurwitz, and Madow. Cf. also a summary by W. Ed- 
wards Deming, “On the possible types vi pling uuit in the last stage of selection in a probability 
sample,” New York: Advertising Research Foundation, 1955. 
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segments, and the supply of random numbers might 1, 14, 22, 29, 45, 52. 
Random numbers beyond the number of segments actually created 
are blanks and draw no sample. All the dwelling units and all the people 
in the block will thus have the probability 1/9 of coming into the sam- 
ple, and no weighting factor will be required. 

The instructions must contain a special provision to use when the 
field-worker encounters conditions that would cause delay and run up 
the costs excessively were she to create segments in the whole area. 
There, the first step would be to divide the area into portions, ascribe 
to each portion a size in work-loads, and to create segments in only one 
portion which the supervisor will select by the use of random numbers. 
The segment-interval for the selection of the segments for interview in 
this portion must then be reduced in the ratio of the number of work- 
loads in the selected portion to the number of work-loads in the whole 
area, ’ 

There should be an instruction to the field-worker to halt her pro- 
ceedings and to call or write for special advice if she has more segments 
than random numbers: unless her segments are abnormally small, this 
condition may indicate abnormal growth, which will require special 
treatment. 

This procedure has the following advantages: (1) The field-worker 
need not adjust the sizes of the segments in order to produce exactly 
c, 2c, or any other special number of segments within an area. Instead, 
the intended size of the segments may be whatever size appears to be 
best from the standpoint of definition, efficiency, and completeness of 
coverage. (2) If the random numbers select more than one segment from 
an area, the segments selected will be scattered over the area, and there 
will thus be some small gain from the stratification so enforced. (3) 
The creation of the segments, the selection of the sample, and the inter- 
viewing, may take place in one visit, except for recails where there was 
nonresponse. 

The selection of the segments may take place in the office, after the 
field-worker turns in her identification of the segments. This, of course, 
must be the procedure for subsequent surveys. 

This procedure goes smoothly in the field, with simplicity, economy, 
and statistical efficiency. It is nevertheless entirely optional, and is not 
an essential part of the methods of this paper. 

Complete freedom in the basic design. One not only has complete 
liberty in the basic design, but must exercise it. Thus, one may use 
any moce of stratification that he deems to be efficient, by specifying 
the order for listing the census areas in the main frame. One may, if 
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he wishes, use intermediate stratification of a preliminary sample, 
followed by Neyman allocation. For Neyman or disproportionate allo- 
cation, ore merely (e.g.) halves the width of the zoning interval over 
any stratum to double the number of work-loads per 1000 Cdu’s. One 
may meet subsequently the demand for some other size of sample for 
any district within the whole either by altering the zoning interval in 
that district, or by deleting a random work-load in a fixed number, or 
by drawing a supplemental sample. An additional random number per 
zone will produce a 50 per cent increase. 

In a sample of New York, where the zoning interval was 2000 work- 
loads, in order to produce a 25 »er cent supplementation in the bor- 
oughs of Brooklyn and Queens, I drew 2 supplemental random numbers 
between 0001 and 8000 for every 4 zones throughout the two boroughs. 
The interval 8000 was convenient because this number was to be the 
size of the thick zone for tabulation (vide infra). 

The intended sizes of the work-loads, and of the segments as well, 
need not be constant within any zone nor even within any area. Thus, 
in areas that are difficult to carve, or which will be costly for the field- 
workers to reach, one may cut costs and increase tke over-all efficiency 
by deliberately doubling the number of Cdu’s in 2 work-load. The 
zoning interval will still cut off the same number of work-loads, and 
there will be no change in the probabilities, nor in the procedure subse- 
quent. 

One may feel free to choose any formula that appears to be efficient 
for the estimation of any total, proportion, or other characteristic of 
the frame. Whatever be the choice of the form of the estimate, the 
computation of the standard error thereof will be rapid and valid, as 
may be evident from formulas and procedures that appear later. 

Triplicate and quadruplicate drawings from each zone may at times 
be desirable, or even 10 drawings per zone (the Tukey plan). Some rec- 
ommendations on multiple drawings appear later. 

For a national sample of several thousand work-loads, one may easily 
adapt this procedure so as to draw work-loads in clusters, separated 
but 6t too far. This modification should receive consideration in areas 
where travel is very costly, or where there is a desire to place several 
work-loads under one supervisor. It is only necessary to introduce a 
“local frame” of (e.g.) 1000 or 2000 work-loads as the unit of size 
(the bead) in the initial frame; then to draw (e.g.) 4 distinct work- 
loads from each local frame that falls into the sample. The 4 work- 
loads will seldom be separated more than some predictable distance. 
The 4 work-loads may be forced by strutification to come from 4 differ- 
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ent quarters of the local frame. Replication is obtained by drawing a 
2nd local frame with replacement from the same zone. A local frame 
may come into the sample twice. Ii it does, then a work-load of 2 or 
more segments, but not the segments selected therein, may also come 
into the sample twice.® 

It is not difficult to lay out a plan by which to measure the variance 
between interviewers, or the difference between two methods of train- 
ing, or between two questionnaires, or both, by balanced random as- 
signments in successive zones, but this topic does not require elabora- 
tion here. 

For extra pressure on (e.g.) one-third of the nonresponses, one may 
select at random 1 work-load from every successive 3 work-loads in 
Sample 1, and likewise for Sample 2; then weight these res !ts by 3 
and add them to the initial results. Or, one may attempt some other 
type of estimation.’ 

Heavy spotty growth that has taken place here and there since the 
last census will cause the same trouble in this sampling procedure that 
it causes in any other, and it can be handled in the same manner.® 
This problem, like many others, is not within the scope attempted 
here. 

Two-way stratification with forced selection of heterogeneous sampling 
units. One important variation in design is a 2-way stratification which 
forces areas of unlike characteristics to fall together into the sample, a 
device that in some kinds of studies has shown notable increases in 
efficiency. Thus, one may force areas of heavy industry to fall into the 
sample along with areas of light industry, urban areas to fall with rural 
areas, center with fringe, high rent with low rent, etc. The forcing is 
accomplished by the use of two frames, where we used only one hereto- 
fore. The two frames will cover some region of the domain of study 
(a province, or a region, or a city), but they wil! commence with census 
areas that are opposite in character. Thus, one frame might commence 
with the central part of the biggest cities of predominantly heavy 
industry, and the other frame might commence with open country 
sparsely settled. 

In studies of public opinion in Germany, a separation of communities 
into two lists on the basis of religion was effective. One list commenced 





* The theory for the optimum number of segments in a work-load, and for the optimum number of 
work-loads in a cluster, is in Hansen, Hurwitz, and Madow, op. cit., vol. I, p. 291. 

1 W. Edwards Deming, “On a probability mechanism to attain an econoric balance between the 
resultant error of response and the bias of nonresponse,” Journal of the American Statistical Association 
48 (1953), 743-72. 

* Cf. Hansen, Hurwits, and Madow, op. cit., vol. I, p. 351. 
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with communities that were practically 100 per cent Protestant; the 
other list commenced with communities that were practically 100 per 
cent Roman Catholic. Both lists merged at the bottom with communi- 
ties that were about equally divided. 

The work-loads in both lists will commence with serial number 1. 
The two zoning intervals will be equal, and each list will contain the 
same number of work-loads, with blanks at the end if necessary to fill 
out the last zone. However, the average sizes of the work-loads need 
not be equal in the two lists, nor the segments; and in fact their average 
sizes may vary from one zone to another within the same list to accom- 
modate variable costs (as mentioned earlier). 

One random number z draws work-load number z from one list; also 
work-load number z from the other list. Segments drawn as prescribed 
from each of these two work-loads form the Ist sample. Together they 
form one sampling unit, which consists of a dumbbell, with segments 
of opposite characteristics at the two ends. The sampling units are 
thus made heterogeneous, and there is the additional benefit of strati- 
fication. 

A 2nd random number in the same zone draws 2 more work-loads, 
one from each list; and segments drawn from these work-loads form the 
2nd sample, another dumbbell. 

One may form a separate estimate for any region covered by either 
of the twe frames alone, or for any region covered by both frames. 

Further gains will accrue from the use of Masuyama’s zigzag inter- 
val,* whereby random number z betweer 1 and Z’ draws also work-load 
number Z’—z, where Z’ =2Z. 

Thickening the zones for economy and for assistance in computation. 
For speed and economy in the tabulation and computation, and to gain 
increased validity of some of the formulas that we shall use for the 
variances, we shall combine for tabulation several successive initial 
zones to form a “thick” zone. The thickened populatious will go into 
the formulas ahead. The thickened populations will be bigger and less 
variable, relatively, than the populations of the initial zones, and their 
sampling distributions will be more nearly normal than the thinner 
populations of the initial zones. This approach to normality will im- 





® Motosaburo Masuyama, “Recent advances in sampling surveys in Japan,” Bulletin of the Inter- 
national Statistical Institute, xxxiii, part II (1951), 147-52, p. 149 in particular. The use of a heterogene- 
ous primary sampling unit under one supervisor has been basic since 1940 in the sample for the Monthly 
Report on the Labor Force, designed by Morris H. Hansen and colleagues in the Census in 1940; cf. 
Eansen, Hurwitz, aad Madow, op. cit., chapter 12. Another plan for the use of heterogeneous sampling 
units was published by Roe Goodman and Leslie Kish, “Controlled selection, a technique in probability 
sampling,” Journal of the American Statistical Association, 45 (1950), 350-72. 
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prove the estimate of the variance of a ratio, and will also validate the 
use of the range later on. 

In thickening the zones, one should try to meet the following require- 
ments. In a small survey, where there may be only 20 or 30 zones 
initially, the two requirements may be competitive. 


1. The thickness of a zone should be enough to yield a minimum 
y-population of 10, although 5 can be tolerated. (The y-population 
is the number of dwelling units or of interviews or of some other 

| basic population in the denominator of a ratio z/y. In practice, the 
y-population will almost always be far above the minimum.) Thick- 
ness that will yield populations of 10 or more will permit use of the 
range for the standard error of an estimate of this population. 

2. The number of thick zones should be enough to yield a useful 
estimate of the sampling error, but no more. In this way one holds 
to a minimum the costs of tabulation and of computation. In dupli- 
cate drawings, each zone yields one degree of freedom in the estimate 
of a standard error. Triplicate or quadruplicate drawings will give 
more degrees of freedom per unit of tabulation: vide infra “Multiple 
drawings per zone”. 


It is interesting to note that the thickening process retains the sta- 
tistical efficiency of the initial zones. 


2. THEORY 


Procedure for computing the estimates and their standard errors. For 
illustrative purposes we shall deal with 2 drawings per zone. If we use 
the subscripts 1 and 2 for the two samples in Zone i, then the results 
of the interviewers may be summarized as 

Zu,%i2 for the two z-populations (e.g., the z-population might be the 

number of packages of a certain item of food bought last 
week) 

Ya,¥e for the two y-populations (e.g., the y-population might be the 

number of families that bought food of any kind). 


Usuaily we need estimates of: 


A the total z-population in the entire main frame 
B the total y-population in the entire main frame 
@ the ratio A/B 


For example, A might be the total number of packages of a certain 
item of food that the families in a region purchased during the past 
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two weeks, while B is the total number of families in the region that 
bought food of any kind. The symbol ¢ denotes the ratio A/B, the 
average number of packages purchased per family over this time- 
interval for th:s particular region. 

The sample will provide estimates of the results that would have 
been obtained from a complete census over the region, with the same 
questionnaire as was used in the sample, by the same interviewers, 
working under the same instructions and supervisors, during the same 
period of time. For an estimate of the ratio ¢ we may take 


the z-population in the sample 





~ the y-population in the sample 


_ 
wv 


iy 
in which z is the total z-population in the entire sample, both lst and 
2nd samples combined. The symbol! y has a similar definition for the 
total y-population in the sample. 
For an estimate of the variance of f we first define and calculate for 
(thick) Zone i, 


(1) 


Dui = La — Te 
- *\ (2) 
eo Je 
then calculate 
hg = Da — fDy (3) 
and h,?, whereupon we my estimate 
2\12 
Varf ={1-——)— h? 4 
rf ( =) > (4) 


The summation runs over the m thick zones. Z is the number of work- 
loads per zone, and the factor 1—2/Z is the usual finite multiplier for 
the reduction in variance owing to nonreplacement when 2 work- 
loads are drawn per zone. In practice, 1—2/Z is usually replaceable 
by 1. 

Equation 4 is a simple adaptation of the usual approximate formula 


2 1 
CP =(1- —)—(C4+C,? — CC, 5 
; ( =) an + CC) (5) 


for the square of the coefficient of variation of the ratio f. When there 
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are 2 drawings per zone, we may estimate the quantity in parenthesis 
by the summation.” 


2m jm 





1 = Ta — Te Ya — Yr : 
Ear ee 
w y 


1. & 
~~ mE? ee (Dz: — fD,;)? 
Maz 1 


— Feat (6) 
a? 1 

Herein ~ and 7 are the average z- and y-populations per thick zone 
per sample; wherefore s=2m, and f=Z/%. By definition 


by = fC, = (z/y)C; (7) 


whereupon (4) follows at once from (5) and (6). 

Equation 4 achieves a drastic reduction in labor, when compared 
with any valid formula for a standard error in a multi-stage plan. The 
number of thick zones need rot be large. In practice, from 10 to 20 
serve well. 

The above equations apply to any part of the frame over which the 
probability of selection remains constant. If the rate of sampling 
changes from one part of the frame to another, as in Neyman allocation 
of the sample, then each part of the frame requires a separate estimate, 
which the above equations will supply. 

A simple numerical example of a ratio and its standard error. A survey 
of a small urban area gave the results"! shown in Table 1. Five thick 
zones are of course too few to give a good estimate of a standard error, 
but they provide a simple illustration of the use of the formulas. Let 
the number of males be the z-population, and the males plus females 
the y-population. Then the over-all proportion male is 


x 69 + 50 


= — = —_—_ = 49 (8) 
y 127+115 


The values of h are in Tabie 2. In this particular survey, Z was 8; 
hence (4) gives 





10 Hansen, Hurwitz, and Madow, op. cit., ch. 4C and p. 194. William G. Cochran, Sampling Tech- 
niques (New York: John Wiley and Sons, 1953), pp. 115-18. P. V. Sukhatme, Sampling Theory of Sur- 
veys with Applications (Ames: Iowa State College, and the Indian Society of Agricultural Statistics, 
1954), pp. 139-46. Deming, op. cit., ch. 5. 

1 I am indebted to Josephine D. Cunningham for the results of the survey (an experimental one 
conducted in one of my classes at New York University) and to Edith Del Peschio for compiling the 
populations by thick sone in Table 1. 
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Var f (1 i =) 36.2 
” 8 ) (127 + 115)? (9) 
= .000464 


TABLE 1 


THE NUMBER OF MALES AND OF FEMALES BY THICK 
ZONE IN A SURVEY OF AN URBAN AREA 



































| MALB AND FEMALE MALE FEMALE 

THick Sample Sample Sample 
ZONE D; be wee eS ae S D: 

1 2°] 1 2 1 2 
t=1 21 29 -— 8 13 13 0 8 16 -8 
2 30 23 7 15 9 | 6 15 14 1 
3 14 ll 3 +) 5 4 5 6 —1 
4 48 26 22 25 12 13 23 14 9 
5 14 26 —12 7 il —-4 7 15 —-8 
Sum 127 115 12 69 50 19 58 65 —7 

as | 

*D 22 4846228 10.4 S22 . O22 5.4 Ste 222 5.4 

















* Computed without regard to the sign of D;. 


TABLE 2 
CALCULATION OF hi; AND OF h,? 
f =2/y = (69 +50) /(127 +115) = .49 
































From TaB.e 1 he 
Tuick ZONE ie hj? 
Du Dy Dui —fDy 
t=1 0 —8 3.9 15.4 
2 6 7 2.5 6.2 
3 4 3 2.5 6.2 
4 13 22 2.2 4.8 
5 —4 —12 1.9 3.6 
Sum Boh ho Ee 7 LUX 36.2 
whence 


é, =v .000464 = .022 (10) 














38 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1956 


Variance of a direct estimate. The sampling procedure furnishes a 
direct estimate, rarely useful, for the total of any population in the 
frame. Thus, one might wish to estimate the number of dwelling units 
in the entire frame, which might cover a city or the whole country. 
The direct estimate X of the total z-population A in the entire frame 
will be 

X = 4Z2z (11) 
Z being, as before, the number of work-loads per zone. We may then 
use the estimate 


1 2 ™ 
Ver « —() - =) 8°80 12 
“4 =) 2 (12) 


or its equivalent 








(13) 


al eh D.é 
VVar X 2 / X 
Cx = —————_ = "A. -— — 

* 4 Z x 


This estimate is valid regardless of the sizes of the x-populations and 
regardless of their sampling-distributions. One could in fact use thin 
zones for this estimate, even if some of the z-populations were 0. 

Var X is simple to calculate by the above equation, but we may, un- 
der suitable conditions, reduce the labor even further by use of the 
range, which does not require the sum of squares. We merely calculate 
D,, the average of the m values of D,, taken without regard to sign, 
and substitute it into the formula 











1/3 2 / : ‘aR a4) 
abies ieee A cask 
ek ZV2° 1:13 
Expressed as a coefficient of variation this is 
2 /2m D, 
Cy = "A -—- — 15) 
. Z 1.132 


The factor 1.13 is the Tippett constant d, for samples of 2. One may 
calculate the numerical factors in these equations once and use them 
for all the standard errors in the survey: the factors are all constant 
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except D, and z. These equations will give almost the same result as 
the previous one for Var X, even when the sampling distribution of the 
z-population by thick zone departs considerably from the normal. 

For a numerical illustration of a direct estimate and of its standard 
error we may turn back to Table 1 and compute from (11) that 


X = 4Zx = 3 X 8(69 + 50) = 476 (16) 


for the number of males in the area. Table 2 shows the differences D,,. 
Their sum of squares is 237, whence (13) gives 





‘ —F 7 
oye y/} aa 2 = 11¢ 17 
. r 8 69 + 50 70 (17) 


for the coefficient of variation. Or, we may turn to Table 1 for D, = 5.4, 
with which (15) gives 


. J 2 VY10X 54 
ss elt dent” (69 + 50)1.13 11% (18) 

A further illustration of a direct estimate and its standard error 
occurs later in the sample of Cincinnati. 

Multiple drawings per zone. Triplicate drawings in each zone with 
an initial zone 50 per cent wider than the zone required ror duplicate 
drawings will produce the same size of sample but will give a better 
estimate of the standard error, because there will be 33 per cent more 
degrees of freedom. Quadruple drawings with a zone twice as wide as 
the zone required for the duplicate drawings will yield 50 per cent more 
degrees of freedom (vide the table at the end of this section). However, 
the wider zones may under some conditions suffer a slight loss in the 
efficiency of the estimates of f and of X. 

When there are k drawings per zone, instead of 2, equation (11) for 
an estimate of a total will contain the factor 1/k in place of the factor 4. 
Equations 4 and 12 for the standard errors will also require modifica- 
tion. Thus, in place of (4) we now write 


k\k 1 =e ; 
varf= (1-5) ED leu 2) - fs - 90] (19) 


wherein = and § have the same meanings as they did earlier, and 2; 
and 9, are the average populations per sample in thick Zone i. 
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In place of (12) we write 





Var X = (1 - ). =” > (ty — 4,)* (20) 


K(k — 1) i jet 


These two equations take the simple forms of (4) and (12) when there 
are only 2 drawings per zone (k=2). 

On the other hand, calculations with (14) or (15) go as rapidly as 
with 2 drawings per zone, as these equaticns require no modification 
at all except care to use D,,; as the range between all kz-populations in 
Zone i, and the appropriate change in the Tippett constant d., which 
will be 1.69 for a triple drawing, 2.06 for a quadruple drawing, and 
3.08 for 10 drawings. 

A general plan for the variance of any estimate.“ For an estimate other 
than a ratio (x/y) or a total (X) we need a more general theorem. If 
u, and wu, are the two estimates of any characteristic, u, obtained from 
one random half of the full sample, w. from the remaining half, and u 
from both halves combined, then 


‘ 


Var uw = 2 Var u (21) 
and it is a fact that 
Var u = Var (uw — u) (22) 


very nearly. Hence, to estimate Var u one need only draw 1 of the 2 
work-loads at random from every zone, and compute the estimate u; 
therefrom and (u,;—u)?; then to repeat the procedure by drawing 
another random half, then another, and another. Sample 1 constitutes 
one random half; Sample 2 constitutes another, but yields no new 
information. Subsequent halves thus require random drawings from 
the full sample already at hand. The successive values of (u;—w)* are 
correlated, but the cumulated average value of (u—)’ will soon 
settle down to some numbér which we may accept as an estimate of 
Var u. The number of degrees of freedom in this estimated variance 
will be m, the number of thick zones. 

More generally, when there are k work-loads per zone, we may draw 
at random 1 work-load per zone and use the accumulated average of 
(u;—u)* as (k—1) Var u. The number of degrees of freedom will be 
m(k—1). 





2 Cf. the table in Deming, op. cit., p. 570. 
@ T am indebted to my colleagues William N. Hurwitz and Max Bershad of the Census in Washing- 
ton for the privilege of publishing the method described in this section. 
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Use of the range for a general estimate. In place of the general method 
just described, one may often find a much simpler solution through 
use of the range. Let ua and uz be the two estimates per zone of some 
characteristic, and let uw be the estimate obtained from the entire 
sample. Then the formula 


re Dy 


CGC. = ———__— (23) 
1.13uv 2m 


will be satisfactory for the coefficient of variation of u, provided the 
(thickened) populations that enter into the calculation of ua and ue 
are all 10 or bigger. This formula will serve for a ratio, but the use of 
(4) is safer against small y-populations, and is easy to use. 

The Tukey plan. There is a special convenience in some types of 
problems in the use of 10 drawings per zone, to form 10 interpenetrat- 
ing subsamples, which extend over the entire area sampled, consid- 
ered as one thick zone. Tabulations for each of the 10 subsamples will 
give 10 separate estimates of any characteristic, be it a ratio or a total 
or anything else. The 10 subsamples pooled give the final estimate u 
-of the characteristic, and the variance between the 10 separate esti- 
mates u,;(i=1, 2,..., 10) furnishes an estimate of the variance of u, 
with 9 degrees of freedom. This plan goes under the name of the Tukey 
plan,“ and it was used first in 1949.% Systematic subsamples are 
especially convenient in the Tukey plan, but it is better to draw fresh 
random numbers in each zone if there is any possibility of a loss in 
efficiency through serial correlation from zone to zone. 

A quick mental calculation of the standard error in the Tukey plan 
with 10 subsamples may be had by using the following estimates: 











D,: 
=—=+ (24) 
98 10 
a Dy Du 
C= = (25) 
9.8u 10u 
D, is simply the range or the arithmetic difference between the highest 
and the lowest of the 10 estimates u,(i=1, 2,---, 10). The factor 


9.8 is the product of the Tippett constant d,=3.08 for a sample of size 
10, multiplied by 1/10. For a better estimate of the variance, one calcu- 
lates 





4 Described in Deming, op. cit., pp. 99 and 353. 
% W. Edwards Deming, “On the sampling of physical materials,” Revue del’ Institut International de 
Statistique, 1950, 1-23. 
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10 
i— u)? 
10 X 9 “ reais an 

A still quicker solution to the margin of sampling error in u is the 
simple observation made by Mahalanobis that the probability is only 
(4)!°=1/1024 that the maximum of the 10 estimates u; lies below the 
median of all the estimates that are possible by repetition of the same 
sampling procedure. There is an equal probability that the minimum 
of the 10 estimates lies above the median. 

One may use one set of interviewers in a random 5 of the 10 sub- 
samples, and another set of interviewers in the remaining 5 subsamples. 
There are then 8 degrees of freedom for the pure sampling variance, 
1 degree of freedom between sets of interviewers, and 9 in the total 
variance. One may of course randomize the interviewers in all or in a 
part of the area covered by either of the 5 subsamples to obtain more 
detailed information on the variance between interviewers. 

Lahiri in his paper cited in the first footnote gives arguments for the 
use of only 4 or 5 interpenetrating subsamples in a large-scale survey 
whose results will be of general interest, and he presents a further argu- 
ment for presentation of the results by subsample. Briefly, (a) rare 
characteristics have a better chance to appear with sufficient frequency 
to provide a good estimate in each subsample if there are not too many 
subsamples; (b) the degrees of freedom furnished by the subsamples 
will be more effective if the distribution of the estimates by subsamples 
is nearly normal; (c) standard errors estimated from 4 or 5 interpene- 
trating subsamples will be sufficiently precise for most uses; (d) if the 
results ©: the survey are presented by subsample, the user may estimate 
the standard érror or any function that he may wish to calculate from 
the populations so presented; (e) the results of 4 or 5 subsamples will 
be far less bulky in publication than the results of 10 subsamples. 
Lahiri’s paper should be read for its illumination of a host of other 
practical problems in the preparation and use of sampling surveys. 

One may compare in the table telow the number of degrees of free- 





Varu = 














4 . Replica- Number of | Degrees of 
Number of thick | tions per | tabulations | freedom df:tab 
— zone required (df) 
10 2 20 10 -50 
7 3 21 14 .67 
5 4 20 15 -75 
1 (Tukey plan) k k k-1 1—1/k 
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dom in the estimates of the standard errors against the number of 
separate tabulations required for different plans. Decision or: the best 
plan will depend on convenience, and partly on this comparison, with 
consideration of possible losses from wide strata. 

To gain experience with a new material, one may in the first survey 
use duplicate drawings, and compare the average variance between the 
two samples in adjacent thin zones with the average variance within 
thin zones. If this ratio lies between 1 and 1.05, there would be but 
little loss in doubling the size o, the thin zones and using quadruple 
drawings in the next survey of this material. 

I may say, however, that in my own practice, I have used either just 
2 drawings per zone, or 10 with one thick zone for tabulation (the 
Tukey plan). 

In connection with rare characteristics, it is interesting to note from 
some recent work by Jones® that the interpenetrating subsamples will 
still give valid estimates and valid standa-d errors so long as two or 
more of the subsamples pick up members v. the universe that possess 
the specified characteristics. Of course, if some populations are rare or 
absent in one or more subsamples, the interpretation of the standard 
error must be made with the aid of the proper theory for skewed dis- 
tributions. 


3. ILLUSTRATIVE EXAMPLE: A SAMPLE OF A DISTRICT 
A. A Sample of the Cincinnati Area 


Preliminary calculations. The district for the first illustration will be 
the Cincinnati area, composed of the cities of Cinzinnati and Coving- 
ton, and the remainder of the counties Hamilton and Kenton, plus 
Campbell County adjacent. The aim of the study was to compare 
readers’ opinions on two special features of a certain newspaper. The 
general plan outlined earlier was broken into two parts, the Ist part 
for the “block cities”,'’ and the 2nd part for the remainder of the area. 
The reason for the split is that the firm that carried out the study 
could carry out the Ist part in their own office, as the only materials 
required therefor were the block statistics published by the Census 





% Howard L. Jones, “On the lower moments of the mean of a Tukey sample,” Chicago Illinois Bell 
Telephone Co., 1955. “Investigating the properties of a sample mean by employing random subsample 
means,” this issue. 

17 A “block city” is a city for which the Bureau of the Census publishes “block statistics.” To be 
eligible, the city must have contained 50,000 or more inhabitants in the preceding census. A block in 
America is the smallest area that is bounded by streets. The block statistics show for every block city 
the number of occupied dwelling units and many other usefui figures for every block that had occupied 
dwelling units in the last census. In this illustration, the block cities in the sample were Cincinnati and 
Covington. 
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for the two block cities in the survey, plus maps or directories as further 
aids, all of which one can purchase readily. In contrast, the 2nd part 
makes use of figures and maps that only the Census possesses. 

The area to be surveyed contained 276,600 Cdu’s (1950), of which 
179,139 Cdu’s or 64.8 per cent were in the block cities Cincinnati and 
Covington. The number of dwelling units in the sample was to be 
about 900. A work-load was defined as 10 Cdu’s, and 2 work-loads were 
to be drawn from each zone. Thus, 45 zones should yield a sample of 
900 Cdu’s; or, owing to growth since 1950, something over 900 occupied 
dwelling units today. To decide the zoning interval, we note that 
276,000 Cdu’s will give 27,600 work-loads, and that 27,600/45 =613. 
The zoning interval actually adopted in tue face of possible growth 
since the Census of 1950, was 630 work-loads. 

Instructions for the part in the block cities."* The following instruc- 
tions apply to the block cities Cincinnati and Covington. 


1. Prepare a list of the tracts in the order shown in the Census 
statistics for tracts, and show for each tract the number of work- 
loads therein. Form the accumulated total work-loads tract by tract. 
The accumulated totals will ascribe a serial number to every work- 
load in the two block cities (Table 3). 

2. Draw 2 random numbers for each of the 29 zones, one for the 
lst sample, and one for the 2nd sample. The zoning interval will be 
630 work-loads. Record these numbers in two columns in the order 
drawn (Table 4). Every random number will identify by serial 
number a certain work-load for the sample; also the tract and the 
block in which it lies. 

3. Identify the blocks that contain the sample of work-loads. To 
this end, for every tract that was struck by the random numbers: 

a. Prepare a list of the blocks therein, in the order shown by the Cen- 
sus block-statistics. Show for each biock the number of Cdu’s therein and 
assign to each block a number of work-loads. 

b. First, tie any block of less than 20 Cdu’s to another adjacent, and 
assign a size to the pair.’* It may occasionally be necessary to tie 3 or more 
biocks together. Tie likewise blocks that had size 0, as they may now be 
occupied (vide Table 5 for an example). 

ec. Force the total number of work-loads accumulated for the blocks 
within a tract to agree with the size ascribed to that tract in Step 1. Do 
the forcing by adding or subtracting a work-load from the biggest block or 
combination. 





18 These are the actual instructions to the firm. They are printed here only for illustration of the 
theory and principles, and not as patterns suitable without modification for cther surveys. 

19 Step 3b provided enough segments for a second survey, to be taken a few months later. This step 
may be omitted if there will be no further surveys in the area. 
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TABLE 3 


SERIAL NUMBERS OF THE WORK-LOADS BY TRACT IN 
THE BLOCK CITIES CINCINNATI AND COVINGTON 








‘ NumBeror| Serrar NuMBERS 
Ciry Tract CDU’s Work- OF THB 

(1950) Loaps Worx-Loaps 
Cincinnati 1 2019 202 1— 202 
2 2334 233 203- 435 
3 2729 273 436—- 708 
4 2861 286 709—- 994 
5 4577 456 995- 1450 
6 1461 146 1451— 1596 
7 1705 170 1597— 1766 
8 1473 147 1767— 1913 
9 2724 272 1914— 2185 
10 2221 222 2186-— 2407 
107 ot 75 15754-15828 

108 175 
109 87} 26 15829-15854 
110 486 49 15855-15903 

Ward 

Covingtoa 1 2316 232 15904-16135 
2 1829 183 16136-16318 
3 2080 208 16319-16526 
4 2066 207 16527-16733 
5 6222 622 16734-17355 
6 5508 551 17256-17906 
*Blanks 364 17907--18270 

















* Blanks added to fill out the sone, to keep the zones for the block cities clean of the surrounding 
area, for ease in tabulation. 


4. Draw a random number between 1 and the total number of 
work-loads in this tract, to locate the block or combination in which 
the work-load falls. Make a list of these blocks or combinations, 
and show how many work-loads were ascribed to each one in Step 3c. 

In a big tract, it may save time to form groups of 5 successive blocks, 


and to show the detail block by block only for the group struck by the ran- 
dom number. 
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TAELE 4 


Tae Samp_e or Loca Frames In Tue Buiock Cities Cincinnati AND Cov- 

INGTON, Drawn By Use or Kenpautt ANp Smitn’s Random Numbers, 41TH 

TuHovusanp, Cots. 5, 6, 7, Ling 15. Each Ranpom Numser From Tue Boox 

Lies BETWEEN 000 anv 629, ANp Is Transitatep By AppitT1on To Tue Lert- 
Hanp Bounpary Or Tue Zone 








Ranpdom DrawIncs 




















ZONE | BouNDARIES OF 
(Thin) Fon Seaton From the book Translated 
Sample 1 | Sample 2} Sample 1 | Sample 2 
1 00001-00630 401 211 402 212 
2 00631-01260 122 087 753 718 
3 01261-01890 222 559 1483 1820 
4 01891-02520 457 424 2348 2315 
5 02521-03150 {| 211 012 2732 2533 
etc. | 











5. In each block or combination, create segments by the pre- 
scribed rules.*° 

6. Draw one segment at random from every successive c segments, 
and canvass it. (c is the number of work-loads in the block.) This 
the field-worker may do on the spot, immediately after she finishes 
the job of delineating the segments. 

Tables 3 and 4 illustrate the above steps. 

Instructions for the area outside the block cities. This part of the work 
was carried out by the Census, down through Step 10. The instruc- 
tions to the Census follow (numbered continuously from the previous 
steps, for convenience in-reference). 

7. Prepare a list of the enumeration districts (hereafter E.D.’s.) 
in the three counties outside the block cities, in any order convenient. 
Show for each E.D. the number of work -loads therein, and form the 
accum’ lated totals. Tie an E.D. of less than 40 work-loads to an 
E.D. that is nearby on the Census list, and ascribe a number of work- 
loads to the combination. It may occasionally be necessary to tie 
3 E.D.’s. in one combination. The accumulated totals will ascribe 
a serial number to every work-load. 

8. In areas where there have been special censuses since 1950, 
without change of boundary, the Census will use the new fig- 
ures. 





2 Cf. the section in Part I entitled, “Suggested field-procedure for the selection of segments.” 
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TABLE 5 


Ranpom NumsBer 1820 In Taste 4 Struck Tract 8. Tuts Tass Soows Tue 

Biocks Or Tract 8, AND Tue Serta, Numpers Or Toe 147 Worx-Loaps 

Tuerein. A Ranoom Drawine Between 1 Anp 147 Se_ecrep A Worx-Loap 
From Tue ComBInaTIon Or Biocks 5 ANp 6 For Tue Samp_eE 














, NuMBER OF | Serta, NUMBERS OF 
Taace Bioce CDU"s Work-Loaps | Tae Worx-Loaps 
S 1 24 Se 
> ~ 10 1767-1776 
3 101 10 1777-1786 
4 121 12 1787-1798 
5 77 
r as 9 1799-1807 
7 85 8 1808-1815 
8 118 12 1816-1827 
9 91 9 1828-1836 
10 23 
“7 vot 6 1837-1842 
12 114 ll 1843-1853 
13 55 6 1854-1859 
14 98) 
15, 16 p } 10 1860-1869 
17 99 10 1870-1879 
18 61 6 1880-1885 
19 96 
| 20 0 10 1886-1895 
21 26 
9 
93 i 5 1896-1900 
22 132 13 1901-1913 
Total 147 147 ° 

















9. Draw 2 random numbers for each of the 16 zones outside the 
block cities, one for the 1st sample, and one for the 2nd sample. The 
zoning interval will be 630 work-loads. Record these numbers in 2 
columns in the order drawn. Every random number will identify by 
serial number a certain work-load; also the E.D. (or a combination 
of 2 or of 3 E.D.’s) in which this work load falls. The Census will 
furnish a map and a description of each of these E.D.’s. or combina- 
tion, together with the figure that in Step 7 prescribed the number 
of work-loads therein. 

10. As an alternative to Step 7 the Census may, for economy, use 
2 stages. The first stage might be to list groups of 5 or 10 E.D’s. 
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and to show for each group the number of work-loads. Within any 
group that is struck by a random number, it will then be necessary 
to ascribe a number of work-loads to every E.D. therein and then 
to force the total number of work-loads for the group to agree with 
the original measure of size assigned to it. Do the forcing by adding 
or subtracting a work-load from the biggest E.D. or combination. 

11-12. The firm will now delineate segments in the E.D’s. in 
which the work-loads fell, and will draw segments by random num- 
bers for canvass. The work follows Steps 5 and 6 and need not be 
written out here. 


B. Numerica! Calculations for the Cincinnati Area 


Some numerical results for Cincinnati. Table 3 shows the number of 
Cdu’s in the tracts of Cincinnati and of Covington, the number of 
work-loads ascribed to each tract, and the serial numbers of these 
work-loads. 

Table 4 shows the random numbers for these cities (Step 2). The 
work-loads that bear these serial numbers belong to the sample. There 
was a similar set of random numbers for the area outside the block 
cities (Step 9). 

Comparison with Table 3 shows which tracts contained the work- 
loads in the sample. For example, work-load number 1820 lies in tract 
number 8. Next comes Step 3. One may turn to Hansen, Hurwitz, 
and Madow, op. cit., pp. 248-52, to see an example that is similar; 
nevertheless, I give in Table 5 the detail in Tract 8, for the convenience 
of the reader. The sum of the work-loads block by block turned out to 
be 147, without any forcing. In Step 4 we draw a random number be- 
tween 1 and 147: this random number turned out to be 33, which drew 
work-load number 1799=1766+33, whicl. draws the combination of 
blocks 5 and 6. Steps 5 and 6 need no description here. 

The results for several characteristics for the two samples over the 
entire Cincinnati area are in Table 6. From this table we see that the 
total number of households in the sample is (449+-454), which substi- 
tuted into (11) gives 


X = } X 630(449 + 454) = 284,445 (27) 


for an estimate of the total number of households in the area in June 
1954. This figure compares with the number 276,000 in the census of 
1950. For the standard error of this estimate, we note that the average 
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TABLE 6 


SUMMARY FOR THE 2 SAMPLES OVER THE ENTIRE 
CINCINNATI AREA 








CHARACTERISTIC Samp_e 1 SaMpLeE 2 

Number of dwelling units encountered* 449 454 
Number of individual persons 

Encountered t 452 460 

Refused 41 42 

On vacation 31 19 

Not found ;f{ deaf, sick 34 37 
Households not qualified (do not receive the 

particular newspaper studied) 104 98 
Households qualified 242 264 





* This is the z-population used in (27). 

+ These are the interviews attempted. The rule was to interview all the people in families of 1 per- 
son; 1 at random from each family of 2 persons; 1 at random from every 3 persons in families of 3 or 
more persons. The final results for any household were then weighted inversely by the number of per- 
sons interviewed thurein. 

t Not found in 6 recalls. 


difference D, from Table 7 is 7.9, whence (15) gives 


- V2X9 X 7.9 


é = 33 28 
*  1,13(449 + 454) 7% (28) 





for the coefficient of variation of the estimate X. Equation 13 gives 
3.2 per cent. This agreement between (15) and (13) was predicted and 
is typical. The standard errors of any other characteristics are com- 
putable likewise, but I shall not show a further example here, except to 
add that the time required for the computation of 6 standard errors 
by (15) was about 15 minutes with a slide rule, once the figures by thick 
zones came to hand. 

There was no randomization of interviewers here; hence (as in most 
surveys) the differences between the Ist and 2nd samples in Tables 6 
and 7, and also the standard errors computed above, are not purely 
errors of sampling, but are a combination of the uncertainty introduced 
by sampling and of the differences between interviewers. If the same 
interviewer were assigned to both segments ir any zone, then the 
standard errors as calculated would measure the sampling errors, 
plus the random component of response, not confounded with the dif- 
ferences between the interviewers. If 2 interviewers were assigned at 
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TABLE 7 


TotaL Dwe.iinc Units By Tuicx Zonres By Suspsamp.e, In Tue Survey or 
CrincinNAtTI. Toe DirreRENcEs AND THErR Squares ARE For Use 
Iw (15) AnD (13) For Tue Sranparp Errors 


* 


























Tuick ZonE SAMPLE 1 SaMPLe 2 D; D;? 
1-5 51 45 6 36 
6-10 55 40 15 225 

11-15 50 47 3 9 
16-20 37 56 —19 361 
21-25 49 60 —11 121 
26-30 40 42 —2 4 
31-35 52 48 4 16 
36-40 46 52 —6 36 
41-45 69 64 5 25 
Total | 449 454 71" 833 

Average per 

thick zone | 49.9 50.4 7.9* 92.6 








* Computed without regard to the sign of Dj. 


random in 2 zones, it would be possible to compute the variance be- 
tween interviewers as well as the random errors.” 

Comparison with the Poisson variance. It is interesting to note that 
the precision delivered by the sampling plan here for the estimate of 
the number of dwelling units in the Cincinnati area is remarkably 
close to the precision that would have arisen from work-loads whose 
sizes followed a Poisson distribution. Thus, from Table 6, the average 
size of a segment in actual dwelling units encountered was 3(449+454) 
/45=10. If the distribution of sizes were Poisson-like, the variance of 
this distribution would alse be 10, and its coefficient of variation would 
be 1/4/10. The coefficient of variation of the mean of a sample of 90 
segments would then be 1/+/900 or 3.3 per cent, in remarkable sgree- 
ment with (28). The interpretation is that the sizes of the segments 
turned out to have a distribution approximately like a Poisson variate, 
partly as a result of the rule that clear and definite boundaries of a 
segment take precedence over equality of size, and partly as a result 





™ This is the fundamental idea in Mahalanobis's system cf interpenetrating samples; see the refe>- 
ences to his works in the first footnote. See also Hansen, Hurwitz, and Madow, op. cit., vol. II, ch. 12, 
sec. 3. A pletely orthogonal! application occurs in the author’s paper, “On the sampling of physi- 





cal materials,” Revue de l'Institut International de Statistique, 1950, pp. 1-23. 
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of uneven changes in population since the last census.** Under such 
conditions the equation 


1 
Cx = es (29) 


sets a limit to the precision attainable in an estimate (X) of a total. 
For any characteristic that shows door-to-door correlation, the vari- 
ance will of course be higher. With special effort to acquire recent 
figures on growth and changes in the populations of areas, it may be 
possible to create work-loads of more uniform size, and in recurring. 
surveys it may pay to do so, especially if an estimate of a total is im- 
portant. An estimate (f) of a proportion is usually not sensitive to the 
variation in the size of the work-load. 


4. A NATIONAL SAMPLE 


Preliminary calculations. In connection with another project, the 
number of work-loads in a national sample was to be about 500, and 
the number of zones about 250. The number of Cdu’s, in the entire 
country from the Census of 1950 is about 43,000,000. A convenient 
zoning interval was 17,500, derived from 4,300,000/250= 17,200. 
Because of growth since 1950, the yield in dwelling units of such a 
sample, with an average of 10 dwelling units per work-load, wes well 
over 5000. 

Instructions for the block cities.* 

13. Prepare a list to show the block cities by geographic region 
(e.g., New England). List the cities in geographic order within each 
region, east to west, south to north within size-groups (over a million 
inhabitants, 250,000 to a million, under 250,000).% Opposite each 
city show the number of work-loads therein. The accumulated totals 
will give a serial number to every work-load. Continue the accumula- 
tion from one region to another. 

14. Draw 2 random numbers in every zone. (The rest of this para- 
graph follows Step 2.) 

15. It remains now to identify the sample blocks. Te this end, for 





” My friend F. F. Stephan of Princeton kindly pointed out this useful and interesting observation. 

® For convenience in reference, the numbers assigned to the steps in this section are continuous with 
the previous steps. 

% This order will give good statistical efficiency for most purposes, and it will facilitate tabulations 
by size of city. One may specify some other order for listing the cities if he prefers. The stratificution, like 
many other parts of these instructions, should be altered to achieve the best statistical efficiency for the 
purposes at hand. 
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every city that was struck by the random numbers, identify the 

page in the statistics for tracts. This can be done by accumulating the 

number of work-loads for each page or for a whole group of pages for 
any city. First force the total number of work-loads assigned to this 

city in Step 13. 

16. Identify the particular tract on the page that was identified 
in the preceding step. First force the total number of work-loads 
accumulated by tract to agree with the number assigned to this page. 

17. Identify the blocks that contain the sample of work-loads, as 
in Step 4. 

18 and 19. Same as Steps 5 and 6. 

Instructions for the area outside the block cities. As in the sample for 
Cincinnati, this part was carried out by the Census. The strata were 
those defined by the Census for the Monthly Report on the Labor 
Force.* The steps were otherwise similar to Steps 7-12 for Cincinnati. 


5. SAMPLING WITEOUT STATISTICS FOR SMALL AREAS 


A sample of the Federal District of Mexico.** This area showed 604,000 
dwelling units in the Census of 1950. There are totals for each section, 
but not for the blocks within the sections. The feature of special interest 
here, different from the previous illustration, is that within any section 


we assign to each block the average number of dwelling units. 

Another feature is that the city has been growing at the rate of 6 per 
cent annually; hence, it seemed desirable to increase arbitrarily by 50 
per cent the sizes of the blocks of some of the outlyiug sections wherein 
the growth appears to be most prominent. This decision i'lustrates 
the fact that the number of Cdu’s in an area is not necessarily a figure 
published by the Census; it may instead represent one’s best informa- 
tion. 

Lack of census information for small areas does not alter the proba- 
bility of selection nor introduce bias; it only subtracts from the effi- 
ciency that would be possible ctherwise, and it adds a few problems to 
the fieldwork, owing to the fact that the work-loads assigned for inter- 
view will usually be more variable. 

This study was to be an investigation on the cost of living. An inter- 
viewer could cover an average of 5 or 6 families in one day. The possible 
adjustments in the tasks of the interviewers that sometimes arise from 





% Had there not been already in existence a suitable mode of stratification, it would have been 
necessary to prescribe one. 

* I am indebted to Ana Marfa Flores, Chief of the Department of Sampling in the Census of Mex- 
ico, for the data in Table 8 and for the privilege of working with her on this survey; likewise to José 
Nieto de Pascual in the same department. 
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inequalities in the sizes of the work-loads would not be serious here, 
because a small work-load in any area would likely be counterbalanced 
by a big one not far away. It appeared advisable, therefore, in the 
interest of simplicity of preparation and administration, to assign to 
each block within a section a designated number of work-loads, this 
number so chosen that it produced an average size of 5 dwelling units 
per work-load over the whole section. The plan for Mexico thus con- 
forms otherwise to the plan for Cincinnati. Table 8 shows the start 


TABLE 8 


THE COMMENCEMENT OF THE TABLE OF SERIAL NUMBERS 
FOR THE SAMPLE OF WORK-LOADS IN MEXICO CITY 














ESTRATA 1 
Average | Number of work-loads 
number Serial numbers 
Section Number of Cdu’s In the of these 
of blocks per block Per whole work-loads 
1951 block | section 
1 4 140 28 112 0001-0112 
2 11 127 25 275 0113-0387 
3 16 126 25 400 0388-0787 
4 17 100 20 340 0788-1127 
5 8 145 29 232 1128-1359 




















of the assignment of serial numbers to the work-loads in Estrata 1, a 
portion of Mexico City. 

One may also, in the absence of census data for small areas (such 
as by blocks), make a quick tour of a city or other area and estimate 
very roughly the number of dwelling units in each small area, then 
proceed according to the instructions for the survey of Cincinnati. 











INVESTIGATING THE PROPERTIES OF A SAMPLE MEAN 
BY EMPLOYING RANDOM SUBSAMPLE MEANS 


Howarp L. Jonzs 
Illinois Bell Telephone Company 


1. INTRODUCTION 


SAMPLING procedure consisting in the selection of systematic sub- 
A samples with multiple random starts was briefly described by 
Madow and Madow [8, pp. 8, 9] in 1944. Several telephone companies 
in the Bell System have made considerable use of similar procedures, | 
which were suggested by J. W. Tukey in connection with sample 
surveys to determine the per cent condition of telephone property. 
This general method of sampling, with the xumber of subsamples nearly 
always equal to 10, has been used by W. E. Deming for surveys of this 
kind in Illinois and other states. Some applications, and their relative 
advantages, are discussed by Deming in his article “On Simplifications 
of Sampling Design Through Replication with Equal Probabilities 
and without Stages”, this issue. 

To design a sampling plan of this type, we first assign a serial number 
to every item in the population. A counting interval is then computed, 
the formula for 10 subsamples being 


k = 10N/n, (1.1) 


where N is the size of the population, n is the desired number in the 
sample, and k is the required counting interval. The next step is to 
divide the population into slices, with the first k serial numbers in the 
first slice, the second k numbers in the second slice, and so on. Next 
we select 10 sample serial numbers, without replacement, from each 
slice. For the first slice, the 10 serial numbers are selected by employ- 
ing a table of random numbers. For the remaining slices, we choose 
one or the other of two procedures. 

The procedure originally proposed by Tukey, here designated as 
Method A, is to add multiples of k to the serial numbers of the sample 
items in the first slice. Suppose we let a, b, c, ---, 7 represent the 
random serial numbers in the first slice. Then the sample serial numbers 
in the second slice will be (a+k), (b+k), (e+k),---, (7+); the 
sample serial numbers in the third slice will be (a+2k), (b+2k), 
(c+2k), - ++, (j+2k); and so on. After the sample items correspond- 
ing to these serial numbers have been selected, they are combined into 


10 subsamples, the first subsample consisting of items numbered 
; 
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a, (a+k), (a+2k), ---+; the second subsample consisting of items 
numbered b, (b-+k), (b+2k), - - - ; the tenth consisting of items num- 
bered j, (j+k), (j+2k),--+. Ii each serial number from 1 to N 


corresponds to an item in tae population, eech of the 10 subsamples 
will include one sample item from each of the slices into which the 
population has been divided, except for a possible slight variation in 
the last slice. 

The alternative procedure, here designated as Method B, has been 
used rather extensively by Deming. It is like Method A in that each 
subsample contains one serial number from each slice. But the 10 sam- 
ple numbers for each slice are selected at random, without replacement, 
and not by adding multiples of k to the serial numbers in the first slice. 
These 10 sample serial numbers are assigned to subsamples in order 
of random selection. The relative advantages of Method A and Method 
B will be discussed in section 4. 

For either procedure, the mean of each subsample is computed, and 
the over-all mean is found by averaging the subsample means, thus: 


1 
#=— > z,, r=a,b,---,j, (1.2) 


where z, denotes the mean of the rth subsample, Z is the over-all 
sample mean, and g is the number of subsamples. As an estimate of the 
mean of the population, ~ is unbiased if N is an even multiple of k. 
In any case, the variance of ~ can be estimated by employing the for- 
mula 


63? Zp — £)? 

; negate . (1.3) 
where X is a constant such that é,*, the estimated variance of 2, is 
unbiased. A constant of this kind, with a value that is equal to one in 
sampling with replacement and that approaches one when the sampling 
fraction is small, is called [5, p. 103] a finite multiplier. 

For the general case where there are g subsamples, the value of the 
finite multiplier for Method A has been given elsewhere [7, p. 33, 
equation (40) ], without proof, as 


h = (N — n)/N. (1.4) 
If N is an even multiple of k, this ratio may be written as 


\ = (k — g)/k. (1.5) 
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It will be shown in sections 2 and 3 that this value of the finite mul- 
tiplier applies to both Methods A and B. 

Stratified sampling, with different sampling fractions for different 
strata, is frequently employed in designing sampling plans where the 
sample items in each stratum are selected by either Method A or 
Method B. Differences between strata in dispersion and unit sampling 
costs are taken into account in making the optimum allocation of sam- 
ple sizes among these strata. The usual procedure is to make the num- 
ber of subsamples, g, the same for every stratum, but to vary the 
counting interval, k;, from stratum to stratum so that the ratio 
g/k; is the desired sampling fraction. Samples of this type, where differ- 
ent sets of random numbers are used in selecting the items from dif- 
ferent strata, are discussed briefly in section 5. 

In many sampling problems, it is desirable to assign serial numbers 
or to employ a sampling frame so that the number of units brought 
into the sample is a random variable, and the average of the sample 
zx’s is therefore the ratio of one random variable to another. A ratio 
of this kind, used as an estimate of the corresponding ratio for the 
entire population, is called a ratio-estimate. In some sampling studies, 
it is desirable to compute a weighted average of several ratio-estimates 
that are not statistically independent, as when we wish to determine 
the weighted average of the per cent condition of two or more related 
classes of telep:ione property where units of these different classes are 
brought into the sample through the selection of common sample 
locations each of which may include one or more units of these classes. 
Formulas for approximating the bias and variance of such weighted 
average, when either Method A or Method B is used to select the 
sample locations, will be derived in section 6. The situations to which 
these formulas apply should not be confused with those discussed in 
section 5 where the samples for different classes or strata are selected 
by employing different sets of random numbers and are therefore 
statistically independent. 

A situation where there is just one class of items, and therefore just 
one ratio-estimate, may be regarded as a special case among the situa- 
tions discussed in section 6. It includes the situation where Methoa A 
or Method B is used in selecting the sample and where the number of 
items may vary from one subsample to another, as when certain serial 
numbers selected may not correspond to items in the population. For 
example, a sample of telephone customers may be selected by choosing 
telephone numbers at random, even though every number is not as- 
signed to a customer. Numbering procedures where unassigned num- 
bers are present frequently save considerable time in selecting samples. 
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In many sampling surveys, the primary purpose is to compute a 
confidence interval for the mean of the population sampled. In nearly 
every survey where scientific sampling is employed, the confidence 
interval is iraportant. The ¢-distribution is used to compute this interval 
for a sample selected by either Method A or Method B, the related 
number of degrees of freedom being determined as outlined in section 7. 
When the sample size exceeds 100, however, it is usually desirable te 
increase the number of degrees of freedom beyond 25, and thus increase 
the precision of the variance estimate, by first computing the variance 
of each one of several strata or component parts of the population on 
the basis of subsample means, and then combining these estimates in 
the usual way. The confidence interval can then be closely approxi- 
mated in most cases by employing the cumulative normal frequency 
distribution. 

The principal advantages of the procedures discussed here may be 
summarized as follows: 

1. For samples consisting of several hundred or several thousand 
items, a confidence interval for the sample mean can be computed at 
considerable saving in time and effort with a negligible loss in effi- 
ciency. 

2. When the sample mean is to be used in a complicated formula 
involving random variables, a confidence interval for the end result 
can be found rather simply after inserting the subsample means in 
the formula, one at a time, and then computing the variance of the 
several results. : 

3. The subsample means convey information to both statistician and 
layman, in a simple fashion, as to the variation in the means of samples 
of the size of each subsample. This information aids in forming conclu- 
sions as to the variability and possible bias in the over-all sample. 


2. METHOD A 


Assume that the items in a population of size N are assigned seria! 
numbers 1, 2, ---, N, where 


N = mk, (2.1) 
both m and & being positive integers. Let g denote some integer such 
that 1Sg<k; and suppose g numbers a, b, ---, 7 to be selected at 


random without replacement from the numbers 1, 2, --- , k. Assume 
g subsamples of the population to be selected, the first subsample con- 
sisting of items numbered a, (a+k), (a+2k),---, (a+mk—k); the 
second subsample consisting of items numbered b, (b-+k), (b+2k),---, 
(b-++-mk—k); and so on. Let z,,), for r=a, b, - - - , 7 denote the meas- 
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urement of some property of the sample item numoered (tk-k+>*); 
and let 


1 
t=—Ditan, t=1,2,-++,m, (2.2) 
™m ¢ 


and 
1 
#=—)> 2, r=a,b,-++,j. (2.3) 
g fr 


Then # is the mean of all the sample measurements, the number of 
which is the product mg. The purpose here is to investigate the mean 
and variance of ~ in repeated samples of this kind. 

Let 2.¢,;) denote the measurement of item numbered (tkh—k-+1) in the 
population; and let 


1 
“n= > Ee t¢=1,2,---,m, (2.4) 
™m ¢ 
for i=1, 2,---, k; so that yu; is the average measurement for the m 
items in the population numbered 7, (¢+k), (¢+2k), -- +, (¢+mk—k). 


Also, let 1 denote the simple average of the measurements for all NV 
items in the entire population. Then 


1 : oe 
o=— DD tuo =— Lm (2.5) 
mk 4 t k é 


Each subsample mean, z,, is actually a random selection from a popu- 
lation consisting of k values of u;. Hence, we may write 


Ex, = p, (2.6) 
where E denotes expected value. From (2.3) and (2.6), it follows that 
Ez = pn. (2.7) 


In other words, Z is an unbiased estimate of yu. 
The variances of z, and #, by definition, are 


a, = E(x, Ez,)? ay E(z, ni )*, (2.8) 
oz? = E(% — EZ)? = E(Z — p)?. (2.9) 


Since 


XL (@ — #)* = Do 2,? — gi 


2.10 
= > (, — u)* - gf — »)?, ' 
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we find by taking expected values that 
E Sl (2, ~ #)* = g(o,? — os*). (2.11) 


r 


But it is known [5, p. 102, equation 13’] that 
os" 2 eS es (2.12) 


since ~ is the mean of a sample of size g selected at random without 
replacement from a population of size k and variance o,*. Combining 
(2.11) and (2.12), we obtain 


N 
gg - 1) 





2 
a = 


E > (x, — #)? (2.13) 


where 





= = “ . (2.14) 


An unbiased estimate of the variance of Z can therefore be computed 
for g = 2, from the formula 
r 


2 = - — #)2. 2.15) 
g(g — 1) ~ i 





3. METHOD B 


Assume that the items in a population of size N are assigned serial 
numbers 1, 2, -- - , N, where 


N = mk, (3.1) 


both m and k being positive integers. Let g denote some integer such 
that 1Sg<k; and suppose a, hi, - - - , j: to be a set of g numbers se- 
lected at random without replacement from the numbers 1, 2, - - - , k, 
arranged in the order selected; and suppose de, be, - - - , je to be a sec- 
ond set of g numbers selected at random without replacement from the 
numbers (k+1), (k+2),---+-, (2k), arranged in the order selected; 
and so on, there being m such sets altogether. Assume g subsamples of 
the population to be selected, the first subsample consisting of the m 
items corresponding to the numbers ay, de, - - - , dm; the second sub- 
sample consisting of the m items corresponding to the numbers },, 
be, - + +, bm; and so on, the gth subsample consisting of the m items 
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corresponding to the numbers ji, jo, «+ * , jm. Let X¢,»), forr=a,b, +++, 
j, denote the measurement of some property of sample item number 
r,; and let 


] 
ee > Zitr)y t= 1, 2, Tt yo (3.2) 
m ¢- 


and 


% =- 


iS f= S, oe (3.3) 


i ee 


The purpose here is to investigate the mean and variance of Z in re~ 
peated samples of the kind just described. 


Let 
1 A 
y4=— Zz Xit.r)s r=za,b,---,J, (3.4) 
9 ¢ 
for t=1, 2,---+, m; so that z, is the average measurement for the g 
sample items with numbers a, b;, - - - , J. Let tu, denote the meas- 


urement for item number (tk—k-+7) in the population; and let 


1 
m= = Lita, {=1,2,:--,k, (3.5) 


' 


for t=1, 2,---, m; so that », is the average measurement for the k 
items in the population with consecutive numbers (tk—k-+-1) to (tk), 
inclusive. Also, let u denote the simple average of the measurements 
for all N items in the entire population. Then 


1 ‘ 1 I 
se -— es p be 2,4) 2 — > » is Zu,t) = — > us; 
mk ‘ t mk t ‘ m™ + 
and from (3.3) and (3.4), 


1 1 1 
& ane p 3 2} a eo >> > Leer) = — iB Tp. (3.7) 
t mg «+ rf ™ ¢ 


mg + 


(3.6) 


Since each sample measurement, z,:,-) is actually a random selection 
from the k measurements for items number (tk—k+1) to (tk), inclu- 
sive, for which the mean value is u,, we may write 


Exes) = wy (3.8) 








INVESTIGATING THE PROPERTIES OF A SAMPLE MEAN 61 
where £ denotes expected value. From (3.4) and (3.8), 
Ex, = mu. (3.9) 


From (3.2) and (3.8), together with (3.6), 
1 
Ex, = — D0 (Eten) = 4; (3.10) 
m ¢ 


while from (3.3) and (3.10), 
Ez=p. (3.11) 


Hence, Z is an unbiased estimate of yu. 
Let the sampling variances of 2(4,-), %:, Z,, and & be denoted by the 
symbols o?,;,,), 0%, o,%, and o,”, respectively. Then by definition, 


Pur) = E(tee) — Exes)? = E(x.2) — K4)?, (3.12) 


o? = E(x, = Ex,)? /- E(x: a Ms)”, (3.18) 
o,? = E(x, — Ez,)? = E(z, — yp)’, (3.14) 
ox? = E(f — EZ)? = E(Z — p)?. (3.15) 


Since z, is the mean of g measurements selected at random from k 
measurements with variance o%,,, the variance of z, is [5, p. 102, 


equation 137] 
k = v 
a? = (- ane ¥ (3.16) 
k-—1 g 


Also, since %q,»), Ze.r, °**, Lom) are statistically independent, the 
variance of their mean for any particular r is 





1 
o,? = —— z 7 8.) t= 1, 2, MRT Soc (3.17) 
m? t 


Similarly, since the z,’s are statistically independent, the variance of 
their mean is 


1 1 /k- 
oz” Se e- > o? = = )— > o? (t,r)> (3.18) 


k-1 : 
Combining (3.17) and (3.18), we get 


: -(—* (3.19) 
Cz ae ’ it 





g 
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which is identical te (2.12). Equations identical to (2.10) and (2.11) 
can also be derived in the same manner as in section 2, leading to a like 
conclusion. In other words, the variance of 7 may be written as 








o;? = ~E > (2, — 2)’, (3.20) 
gg-1) 
where 
k-—-q N-n 
i= = —) (3.21) 
k N 


whence an unbiased estimate of the variance of @ can be computed 
for g22, from the formula 
} pho 
= ———— } (a, — #)*. (3.22) 
gg9-—1) 


4. RELATIVE ADVANTAGES OF METHODS A AND B 


6: 


Cochran [2] has shown that for certain types of autocorrelated popu- 
lations, where the correlogram is everywhere concave upward, the 
variance of the sample mean is less for a systematic sample than for a 
random sample or a stratified sample. If there are periodic variations in 
the population, the variance of the sample mean may be more or less for 
a systematic sample than for a random sample, depending on the 
relationship between the period and the counting interval. For a good 
discussion of systematic sampling, see reference [3], chapter 8. 

The points just made with respect to the relative advantages of 
systematic sampling and random sampling also apply to the relative 
advantages of Method A and Method B as described in the preceding 
section. In other words, Method A is better than Method B with re- 
spect to the variance of the sample mean if the correlogram is every- 
where concave upwards. If there are periodicities, the better method 
depends on the relationship between the period or periods and the 
counting interval, k. Either Method A or Method B is better than 
systematic sampling with a single random start, however, in this im- 
portant respect: the variance of the sample mean can be estimated from 
the sample itself without making assumptions about the distribution 
of the population; and if formulas (1.2) and (1.3) are employed, the 
estimate will be unbiased. Moreover, even if the correlogram happens 
to be concave upward, the gain in precision by using systematic 
sampling with either single or multiple random starts will be small 
unless (a) the correlation between adjacent items in the population 
is large, and (b) the sampling fraction n/N is also large. 
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In designing a sampling study with the objective of computing a 
confidence interval, some attention must be given to the precision of 
estimates of the variance of the mean, as well as to the precision of 
estimates of the mean itself. and to the related number of degrees of 
freedom. In situations where Methods A and B lead to sample means 
with about the same precision, and formula (1.3) leads to estimates 
of the variance with the same precision and degrees of freedom, the 
confidence intervals will necessarily tend to be about the same. For 
Method B, however, it is possible to regard the procedure as a kind 
of stratified sampling and to compute a more precise estimate of the 
variance of the sample mean from the usual formula 


r 1 
by > (Xctr, — F:)*. (4.1) 


2? = — — 
m? gg—1) a 7 





The gain in precision by employing this formula with Method B may 
lead to a confidence interval as rnuch as 20 per cent smaller than the 
corresponding interval for Method A with 10 subsamples where the 
computation includes the use of formula (1.3). This gain in precision 
may more than offset the advantages of Method A, even where the 
latter produces an estimate of with a somewhat smaller variance. 
In fairly small samples, therefore, it may be desirable to design a 
sample with g, the number of random selections per counting interval, 
as small as 2. On the other hand, Method B may require somewhat 
more time in selecting the sample serial numbers, and formula (4.1) 
will usually require considerably more computation than formula (1.3). 

In large samples, stratification should generally be employed. Sup- 
pose, for example, that Z; is just one of a number of means of independ- 
ent samples of different strata of the population, and that we ar2 
interested only in a confidence interval associated with a weighted or 
unweighted average of these sample means, 


Le wye; 

er ff 1,822, F, 4.2) 
i Wj 
j 

where w; is a predetermined weight, or constant, associated with Z,, 

and T is the number of strata. Then an estimate of the variance of 7, 

computed from the formula 


637 = :e 


j 





ry) 


2 


{ ww; 
o2;", (4.3) 





W; 








»» 


| 
\ 
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may be so precise, even if the variance of each #; is computed from 
(1.3), that there is little to be gained by substituting (4.1) for this 
purpose. The related number of degrees of freedom can be computed 
from (7.12) and (7.13) in section 7. 

Practical considerations must also be taken into account in the choice 
of sampling method. Thus, in an actual problem where it was desired 
to select a five-per-cent sample of serially numbered telephone service 
orders, a very simple method had to be devised to let the office forces 
know in advance which orders would appear in the sample. The most 
practical procedure appeared to be to select five 2-digit numbers at 
random for each of the several administrative districts, and designate 
each service order with a serial number ending in one of these 2-digit 
numbers as a sample order. The sample was therefore selected in ac- 
cordance with Method A, each 2-digit number designating a particular 
one of five subsamples in each district. In this instance, the choice 
between Method A and Method B was based on practical considera- 
tions, since there was no reason to suppose a relationship between the 
serial numbers and the sample measurements. 

On the other hand, in employing Method A to select a sample of a 
population with high autocorrelation between adjacent items, it actu- 
ally happened that there were 6 consecutive sample numbers in each 
slice of 30 serial numbers. The statistician was then faced with a 
dilemma. If he used the sample, he could compute confidence limits 
and state that the population mean lay within those limits with the 
assurance that if repeated samples were selected and analyzed the 
same way and a similar statement were made about the population 
mean on the basis of each analysis, the statements on the whole would 
be correct a high proportion of the time. But at the same time, it would 
be apparent to almost anyone who looked at the serial numbers se- 
lected that the particular sample in this instance was quite likely to 
be one of the exceptional cases where the statement would not be true. 
On the other hand, if the sample were discarded and a new sample were 
selected, the computation of a confidence interval would involve the 
difficulty that we do not know the risks involved in a procedure © 
whereby we keep only samples that we like. A dilemma of this kind is 
much less likely to arise under Method B than under Method A. 

While rules to cover every possible contingency are impossible to 
formulate, the following recommendations may serve as a general 
guide in most situations where the sole or primary purpose is to com- 
pute a confidence interval for the mean of a population: 

1. If possible, design the sample so that there will be from 25 to 50 
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degrees of freedom associated with the estimate of the variance of the 
sample mean. If there are less than 25 degrees of freedom, the pre- 
cision of the variance estimate wili be adversely affected and the confi- 
dence interval for the population mean will therefore tend to be wider 
than necessary. On the other hand, exceeding 50 degrees of freedom 
will tend to reduce the width of the confidence interval to a negligible 
degree, and require more computations than the improvement in the 
results is worth. 

2. For samples of size less than 50, employ simple random sampling 
without replacement, so that the probability of selecting an item not 
already selected is the same for every item in the population. Also, in 
cluster sampling where fewer than 50 clusters are to be included in 
the sample, select individual clusters by simple random sampling. 
Again, in sampling attributes where the expected number of sample 
items possessing (or not possessing) some important attribute is less 
than 35, employ simple random sampling. Compute confidence inter- 
vals in the usual manner. 

3. Except where simple random sampling is indicated, arrange the 
population by strata, if practicable, so that items within the same 
stratum tend to be more nearly alike than items in different strata. 
Also, assign serial numbers, if practicable, so that items adjacent to 
one another tend to be more highly correlated than items farther apart. 
In many situations, however, the only practical arrangement of the 
population is the one that already exists. 

4. If the sample size is from. 50 to 100, and the population is so 
arranged that unlike items tend to be farther apart, divide the popula- 
tion into half as many slices as the total number of items to be selected, 
and then select two items at random from each slice. The variance of 
the sample mean should be estimated from (4.1). Use the cumulative 
normal frequency distribution to approximate a confidence interval 
for the population mean. 

5. For samples not covered by the foregoing rules, divide the popu- 
lation into strata, with unlike items in different strata, if practicable, 
and then select several random subsamples from each stratum. In 
most cases, select these subsamples by Method B. Compute the 
variance of the mean of each stratum from (1.3); and the variance of 
the over-all mean from (4.3). Use the cumulative normal frequency 
distribution to compute a confidence interval for the population mean 
if the number of degrees of freedom exceeds 25; otherwise, use the ¢- 
distribution. If in doubt as to the number of degrees of freedom, use 
the formulas in section 7. 
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6. The foregoing recommendations to use the cumulative normal 
frequency distribution are predicated on the supposition that the 
available evidence as to the distribution of the population and the dis- 
tribution of the sample items support the assumption that the over-all 
sample means in repeated samples would be almost normally dis- 
tributed. In doubtful cases, investigate the higher moments of the sub- 
sample means, particularly the third and fourth moments. 

7. In designing the sample, take advantage of devices such as strati- 
fication and disproportionate sampling if they will materially improve 
the efficiency of the sample without increasing the expense. Where 
some of the population items are extremely large or small, select a 
high proportion or even all of such items to increase the precision of the 
results and to minimize the departure from normality in the distribu- 
tion of estimates of the population mean. A very complex sample de- 
sign is seldom necessary or desirable, however. 


5. STRATIFIED SAMPLING 

When stratified sampling is emploved, the primary purpose is usually 

to estimate a weighted or unweighted population mean of measure- 

ments of items in several classes or strata, this mean being of the 
general form 


Lh, Wee 
I ‘ 


Ar 
j 


= > diac date (5.1) 


where 7' is the number of strata, N; is the size of the jth stratum, 2,; 
is the measurement of some property of the ith item in the jth stratum, 
and W; is a weight applied to each measurement for that stratum. In 
a typical situation, W; is the average cost or value of the items in the 
jth stratum. An unweighted mean may be regarded as a special case 
where the W,,’s are equal for the entire population. 

Equation (5.1) may also be written as 


Dd wyu; 
j 


where 
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and y; is the mean of the jth stratum; that is, 
> Viz ; 
=— 5.4 
Bj N, (5.4) 
It follows that an unbiased estimate of u can be computed from the 


formula 


p 3 wa; 
= =-— (5.5) 
W; 
j 


where #; is an unbiased estimate of u;, computed from a sample selected 
by either Method A or Method B, and w; is a predetermined weight 
that is defined by (5.3). If the samples of different strata are selected 
by using different sets of random numbers, so that the values of 2; 
are statistically independent, the variance of % is given by the equation 


¢;?, (5.6) 


where ¢;? is the variance of Z;. It can be estimated from (4.3) if esti- 
mates of the variance of Z, are computed as suggested in sections 2 
and 3. 

Where there are considerable differences between strata in the vari- 
ance of the measurements, or in the cost per measurement, it may be 
desirable to employ different sampling ratios in different strata. The 
usual procedure for varying these ratios is to vary the counting interval, 
k, rather than the number of subsamples, g. The effect of this variation 
on ¢;*? cannot be determined in advance without some knowledge of 
the relationships between the serial numbers and the variances and 
covariances of the measurements. For this reason, there is some diffi- 
culty in determining the optimum allocation of the sample sizes among 
the various strata. The discussion that follows rests on the assumption 
that each stratum is perfectly homogeneous, so that the variances 
and covariances cf the measurements are independent of the serial 
numbers. Unless this assumption is reasonably realistic, the formulas 
derived therefrom should be used with caution. 

Under the assumption just stated, the variance of #; for a sample of 
the jth stratum selected by either Method A or Method B is given 
by (3.18), which may be rewritten as 
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1 /k; 
sel aioe » > @;”, ¢=1,2,+--, mj, (5.7) 
k;-—1 P 
where ¢;? denotes the variance of Z;, k; is the counting interval, m, 
is the number of intervals, g is the number of selections per interval, 
and ¢;’ is the variance of z,;, which is now assumed to be the same for 
every 7. By mal:ing the further substitutions, 


k; = N;/m,;, (5.8) 

dX «7 = mo;’, (5.9) 
‘ 

m;=n;/q, (5.10) 


where n; is the number of sample measurements for the jth stratum, 
we can write 


: N; — n; 
52 = £ (2) o;?. (5.11) 


Combining (5.6) and (5.11) leads to 


V= >) 
7 





w; |g s(t =) . 

ae ce ar ig oj; 

2 wy; | ny \gN; — 5 (5.12) 
j ) 


which shows the relationship between the variance of 7 and the sample 
sizes. Suppose the cost of obtaining the sample measurements for the 
jth stratum is proportional to n;, and that the total cost of the sample 
survey is of the form 


=at >) ng;, (5.13) 
j 


where c; is the cost per measurement for the jth stratum, and a repre- 
sents an overhead cost that remains the same regardless of the sample 
sizes. Then the stratum sample sizes that minimize the variance of z 
for a given cost, must satisfy (5.13) and the equations obtained by 
setting the partial derivative of the expression 


x v (Si) <2 + L(a + 3 tabtyine c) (5.24) 


Wj; 


LD vj 
i 


n;\gN; — 1 








with respect to each n; equal to zero. These latter equations may all 
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be written in the form 


» 22 1 —1 
- ( : ) j@=1,2,---,T. (5.15) 
( 2, wy)%e; 


n;? (gN; —1n;)?/’ * 


On the other hand, the stratum sample sizes that minimize the cost 
of the survey when the variance of Z is specified to be some quantity, 
V, must simultaneously satisfy (5.12) and 7 equations equivalent to 
(5.15). In any case, for the optimum allocation of the numbers of 
sample measurements among the various strata under the assump- 
tions in this section, the n,’s are such that the right-hand side of (5.15) 
is the same for every j. 

If N,/n; is large for each stratum, the expression inside the brackets 
in (5.15) is approximately equal to 1/n,*, and the samples of the vari- 
ous strata should therefore be approximately proportional to wjo;/+/c;. 
Closer approximations to the optimum sizes of the strata samples can 
theoretically be obtained by iterative procedures; but the improvement 
in the results may be illusory, or so slight as not to be worth the effort. 

The discussion of weighted averages in this section and the sections 
which follow applies only to situations where the weights are fixed 
constants, and not random variables. It does not apply, for example, to 
situations where the weights are computed from sample data. 


6. WEIGHTED AVERAGES OF RATIO-ESTIMATES 


To illustrate the type of problem discussed in this section, suppose 
that in sampling the per cent condition of certain classes of telephone 
property, poles are selected as sample locations not only for inspecting 
the poles themselves, but 9!so for inspecting all attachments as well, 
including aerial cable, aerial wire, and drop wires running from poles - 
to customers’ premises. Suppose that these sample locations are selected 
according to Method A or Method B as described in sections 2 and 3; 
and suppose that the average per cent condition is computed separately 
for each subclass, such as aerial cable, aerial wire, and drop wires, 
and that these averages are then combined, after weighting each aver- 
age in proportion to the value of the related property, in order to arrive 
at un over-all average per cent condition for all aerial plant—thet is, 
poles and attachments. Then, since the number of units of aerial 
cable, aerial wire, and drop wires varies from pole to pole, the total 
number of sample units inspected for each of these subclasses is obvi- 
ously a random variable. Moreover, since a properly designed sample 
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of the type described here must include poles not owned by the tele- 
phone company but carrying company-owned attachments, the nuin- 
ber of company-owned poles in the sample is also a random variable; 
and even if this were not the case, it could be regarded as a random 
variable with zero variance. In general, the estimated average per cent 
condition of each subclass of property mentioned here is the ratio of 
two random variables, one of which is the sum of the percentages repre- 
senting the condition of the sample units of property, and the other is 
the number of units included in the sample. The estimate of the over- 
all per cent condition of all aerial plant is a weighted average of several 
of these ratio-estimates. 

The problem arises as to how to compute the variance and possible 
bias of an over-all average of the kind just described, where the com- 
ponent: ratios are not statistically independent. The approach taken 
here is similar to that ordinarily used in approximating the variance 
and bias of a simple ratio-estimate; that is, the estimating function is 
expanded in a Taylor series in powers of differences between the ran- 
dom variables and their expected values, with powers beyond the sec- 
ond degree neglected. 

For the rth one of g subsamples selected by either Method A or 
Method B, let x, denote the sum of the measurements for the sample 
items in the tth one of T categories of items, and n,, denote the related 
number of sample items. Let 


1 


ae = — Do oer, r= 1,2,---,g,;t@=1,2,---, T, (6.1) 
a - 
1 

ne = — Do ne, (6.2) 
a= 

R, — > WL 1r/ Nery (6.3) 
t 

R. = 2) war,/n;, (6.4) 
t 


and 


¥ Bend 
R, = — > R,, (6.5) 
g rf 


where wu, is a predetermined weight (or constant) associated with the 
ith category. Also, let p denote the value of R, (or R,.) that would result 
if this statistic were computed on the basis of a survey covering every 
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item in the population. Assume that R, and R, can each be approxi- 
mated satisfactorily by a Taylor series in powers of 24, and m;,, meas- 
ured from their expected values, with terms beyond the second degree 
neglected. Then it can be shown that R,, has less bias than R, as an esti- 
mate of p; that the bias in R,, can be estimated from the formula 


pA ieee ay | YS) (6.6). 
g-1 


and that the variance of R, or of (Re—B,), as well as the variance of 
R,, can be estimated from the formula 


x 
R, — R,)?; 6.7 
ene ) (6.7) 


vos 





where B,, is the estimated bias in R,, V is the estimated variance of 
R,, R., and (Ry—B,), and d is the finite multiplier defined by (1.5). 
The derivation follows. 

From (6.1) and (6.2), 


Ex, = Ezy, (6.8) 

En, = Eny; (6.9) 
and it can easily be shown that 

p= > ot) (6.10) 

t 

where 

wx; WL Ley 

o(t) = = . (6.11) 





En: Enyw 


Let ¢ and t’ denote different categories of items. Then the first and 
second partial derivatives of the right-hand member of (6.3) with re- 
spect to each z,, and n;, are functions of the following forms: 











oR, W Ler 

=_— — (6.12) 
ON Nir” 
OR, We 

a. 3 (6.13) 
OX tr Ney 
eR, Q2WiL er 
- = ——) (6.14) 








ON ey* Ni? 
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o°R, Wt 
———- = - (6.15) 
OX ON ey Ney? 
oR, eR, o"R, &R, 
bdling 29 0. (6.16) 








Oty? OBeBay,. Onednr, Ozedner 


Let on(t), b(t), dan(t), and @en(é) denote the values of the right-hand 
side of (6.12), (6.13), (6.14), and (6.15), respectively, when 2;,= E2;, 
and n=En,,. Then under the assumption that R, is approximately 
equal to its expansion in powers of (z,,--Ez,,) and (ni—Enz,), with 
terms beyond the second degree neglected, we may write 


R, = S(t) + Do (ee — Exw)oe(t) + XS (me — Ener)ba(t) 
1 
+ ={ ae (Ne 5 Enir)*Onalt) (6. 17) 
+2 + (Ltr — Exy) (Ne nie Enw)beu(0h . 


This relationship may be reduced to the simpler form, 


R, = 3 (1+ Xe — Ne t+ Nu? — XeN uw) O(0, (6.18) 
t 








where 
te — Exe 
xX, = ———=; (6.19) 
Exy 
Nir — Ene 
Raine cnet ° (6.20) 
Eny, 


Comparing (6.3) and (6.4), we see that R, and R, are identical func- 
tions of (x;-, ni) and (24, m,), respectively. Hence, if we let 





z= Ez, 
X,=- , 6.21 
‘ Ez, ( ) 
and 
— En 
Mi, o's (6.22) 
En: 


then under the assumption that R,, is approximately equal to its ex- 
pansion in powers of (z,—Ez,) and (n;—En;,), we may immediately 
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write 


R, = 1+ X:—- Ne + NA — XN OW. (6.23) 
t 
Moreover, by combining (6.5) and (6.17) to (6.22), we have 
<a 1 
R, = % [1 + Xe — Net Li Wot — XuNw) 000. (6.24) 
t r 


Thus, both R,, and R, are now expressed essentially in terms of first 
and second degree terms in (2%—Ez.,) and (n—En,,). The same 
approximate relationships can also be derived by expanding R, and R, 
directly in these terms; but the derivation given here is much simpler. 

To investigate the bias in R,, and R,, we first take expected values 
of both sides of (6.23) and (6.24). From (6.8) and (6.9) and (6.18) to 
(6.22), 


EX, = EXe = EN, = ENw = 0, (6.25) 
so that 


(1+ ENy,? — EXeNw)o(d). (6.27) 


ER, = >> (1+ EN2 — EX.NDo()), (6.26) 
i : t 
ER, = >> 
t 


Combining these last two relationships with (6.10), we can express the 
approximate bias in R,, and R, as 


E(R. — p) = >, (EN2 — EX.N) 41), (6.28) 
t 

E(R, —p)* pas (EN ;,? — EXuNu)o(b). (6.29) 
t 


In view of (6.25), expected values of the form£X?, EN 2, E(X.—N,)’, 
EX,,?, EN,,?, and E(X1-— Nz)? are necessarily identical with the vari- 
ances of X,, Nz, (X:—N.), Xw, Nur, and (Xy—N;,), respectively. A 
direct result of this fact and (2.12) and (3.19) is that when the sample 
locations are selected by either Method A or Method B, 

2 2 di 2 ee 
EX; rf EN, a E(X; SY _* iis (6.30) 
EX? ENiz2 E(Xw—Nuw)? k—-1 g 








whence 
EX.N, k-—-gl 
EXyNye k-1 g 
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Combining (6.28) to (6.31), we obtain 
E(R.—p) k-g 1 
ER, —p) k-1 9° 





(6.32) 


which implies that 
k-g 
E(R. — p) * —— _ R.), (6.33) 
kK g- 
and indicates that, under the assumed conditions, 
E(Re — p) < E(R, — p), (6.34) 


for g=2. In other words, R, has less bias than R,. Formula (6.6) is 
obtained by substituting the actual value of (R,—R,,) for its expected 
value on the right-hand side of (6.33). 

To approximate the variance of R,, we square (6.23) and neglect 
terms in X, and N,; beyond the second degree obtaining 


Rt = | > oo] + | > x0 | + | x va] 
x0 | - 2 u 0 |] p v0 | 


‘(0 | 


N 
xt | 


+2] Deo 


(6.35) 


[= 
+2] Doe] [ 2 
Ik 





p> 
- 2] x oo || x 
~2[ Exe] Evo], 


When we take expected values of both sides of (6.35), certain terms 
vanish, leaving 


oR. = | x co] +E > xo | + Bl x veo] 
+2| Deo|z x nro(o 
-2| x oo | LX xt | 


- &£ 
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_ 2B | u x() | [ u vet |. 


Squaring (6.26) and neglecting moments of X, and N;, beyond the 
second order leads to 


(ER.)* * | x oo] + 2 x o(0)| | x veo(o | 
_ 2 uu o(0| | u xWeoth |. 


(6.37) 


Subtracting (6.37) from (6.36), we find the variance of R, to be given 
approximately by the relationship 


BR. ~ BR.) +E] xa] + | r ve | 


- 28 | re xo || u vt | (6.38) 
- | Dx - Nooo]. 


The variance of R, can be approximated in a similar manner. Squar- 
ing (6.24), and neglecting terms in X,, N., X:-, and N,, beyond the 
second degree, we obtain 


m+ [ Foo] +[ Exe] +[ Eran] 
+2[ Eee || Exe |-2[ Foo |[ E40 
+=| Doo|[ Cov] (6.39) 
-=|% p » oo || paps XN uo | 
-o[ eae] 20] 


When we take expected values of both sides of (6.39), certain terns 
vanish, leaving 
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ER,’ = | u co] + El Xu xo | + | u veo] 
+2[ D0] B| Dvr000 | xs 


_ 2| > o(0 | | Ris XN (0 


Be 28 | p 3 x(o || i Or 


Squaring (6.27), and neglecting moments of Ni, and X;, beyond the 
second order leads to 


(ER)? + | SY 6 | oa 2| .¥ o(0 | | >, vate) | 
ye , i (6.41) 
- 2) > | | oe XN wed |. 


Subtracting (6.41) from (6.40), we find the variance of R, to be given 
approximately by the relationship 


2 2 
E(R, — ER,)? = Bl 2 i x60 | ao al » vd | 
t . t 


(6.42) 
-—2E | De xo || 3. ve | 
t t 
2 
= | D(X: - vot | . 
¢ 
Suppose we let 
R. = R. — Bu, (6.43) 


where B, is defined by equation (6.6). Then from (6.6) and (6.43) 
R, = (1 + a)R, — aR,, (6.44) 


where 


a = X/(g — 1). (6.45) 
From (6.23), (6.24), and (6.44) 
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R. = zl + X,-N,+ (1 +a)(N2 — XN) 


t 


2p wy Xn) | 40. 
g fr 


(6.46) 


By subtracting the square of the expected value of (6.46) from the 
expected value of its square, and neglecting moments and product- 
moments of X,, N:, Xu, and Ny, beyond the second order, we obtain 


2 
E(R. — ER,)* = [ LX (xX. — xi«0 | . (6.47) 


Comparing (6.47) with (6.38) and (6.42), we see that 
E(R, — ER.)? = E(R. — ERy)? = E(R, — FR,)*?. (6.48) 


In other words, R,, R,,, and R, have approximately the same variance. 
But from equations (2.15) and (3.22), we know that when subsamples 
are selected by either Method A or Method B, an unbiased estimate of 
the variance of R, can be computed from the formula 
r 
V = —_——- ) (R, — R,)’, (6.49) 
g (g ree 1) ° 


where V is the estimated variance of R,, and 
ee 
jy eee: (6.50) 
q 


Formula (6.7) follows from this fact and from (6.48). 

The derivation of (6.6) and (6.7) merely substantiates procedures 
suggested verbally by J. W. Tukey. Variations of these procedures are 
sometimes employed. Thus, in one application of (6.6), Tukey em- 
ployed a graphic procedure in which the first step was to compute R 
successively on the basis of (1) the average of ten subsample estimates 
of p, each computed from the formula for R,, (2) the average of five 
subsample estimates, each such subsample being formed by combining 
two of the original ten subsamples, (3) the average of two subsample 
estimates, each such subsample being formed by combining five of the 
original ten subsamples, and (4) the estimate obtained by combining 
the data for all the subsamples and employing the formula for R,. 
The second step was to plot each average on coordinate paper, taking 
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the estimate as the second coordinate, and the number of subsample 
estimates used to compute the average as the first coordinate. Thus, 
the four averages just described were plotted at distances to the right 
of the vertical axis equal to 10 units, 5 units, 2 units, and 1 unit, re- 
spectively, the length of a unit being immaterial. The third step con- 
sisted in projecting the trend of the plotted points to estimate its inter- 
cept on the y-axis. This intercept was taken as an estimate of p, 
corrected for bias; and the difference between each average and the 
estimate of p was taken as the estimated bias in that average. The 
method is based on the assumption that the bias is inversely propor- 
tional to the sample size, this assumption being roughly in accordance 
with (6.33), which implies that all four points plotted as suggested 
here will tend to lie on a straight line. If a straight line be drawn through 
the points (1, R,,) and (g, R,), the intercept on the vertical axis will be 
near the point (0, R.) where R, is computed from (6.43) and (6.6). 
Tukey’s graphic procedure should also be useful in situations where 
the relationship between the bias and the reciprocal of the sample size 
is not approximately linear, and it is therefore better to fit a curve to 
the plotted points. 

Occasionally it may happen that a part’ ar subsample is empty; 
that is, n;- in equation (6.3) may be equ: 10 zero for some particular 
r, for the reason that the sample locations for the rth subsample do not 
include items in any of the categories of the population in which we 
are interested. In that case, the value of g in the formulas and equa- 
tions given here should be understood to mean the number of sub- 
samples that are not empty. The situation is theoretically identical 
with that in which the sample locations are such that empty sub- 
samples are impossible, but the number of subsamples employed is 
determined by a random selection from some universe of positive 
integers. 

When empty subsamples are at all likely, it may be desirable to 
specify in advance that only samples with a minimum number of 
occupied (nonempty) subsamples will be admissible, and that other 
samples ‘vill be discarded. Each estimate of the variance of R,, as 
computed from formula (6.7), will then be unbiased in each of two 
senses: 


1. It will be unbiased as an estimate of the variance of R, for the 
class of samples in which the number of occupied samples is the 


same as the one at hand. 
2. It will be unbiased as an estimate of the variance of R, for the 


class of al] admissible samples. 
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This property of the estimates is illustrated by comparison of the last 

two columns in Table 1, which is based on a simple example where 
three subsamples are selected from a very large population so that 
subsampie means of 0 and 1 are equally likely, but where there is a prob- 
ability of 1/3 that a particular subsample will be empty. The entries 
in the last line of the table cover the situation where admissible samples 
are defined to be those with 2 or 3 occupied subsamples. 

From the foregoing, it follows that under the assumptions previ- 
ously stated, formula (6.6) leads to estimates of the bias in R,, that 
are approximately unbiased in each of the two senses mentioned, and 
tormula (6.7) leads to estimates of the variance of R, and R, that are 
approximately unbiased in each of those two senses. 


7. CONFIDENCE INTERVALS AND DEGREES OF FREEDOM 


For each method of selecting a sample that was discussed in the pre- 
ceding sections, a formula for computing the over-all sample mean was 
given that is equivalent in every case to 


== >) wm, j=1,2,--- (7.1) 
j 


where z; is the average measurement for the jth one of 7’ component, 
parts of the entire sample, and w; is a weight associated with 2; that 
may or may not be equal to every other weight. A formula was also 
given for estimating o,°, the variance of #. Under certain conditions to 
be presently discussed, we can use the results of the computations indi- 
cated by these formulas to determine a confidence interval for yu, the 
over-all mean of the population. This interval lies between the statis- 
tics i, and jig, computed from the equations 


= = — léz, (7.2) 
Bo =f + léz, (7.3) 


where ji, and jy are the lower and upper confidence limits, respectively, 
and ¢ is the value shown in a table of the ¢-distribution (for example, 
Table III in reference [6]) that corresponds with (a) the desired degree 
of confidence that u actually lies between Zi; and zy, and (b) the num- 
ber of degrees of freedom associated with 6,. 

The degree of confidence, usually expressed as a percentage, is equal 
to one minus the probability shown in a table of the ¢-distribution for 
the number of degrees of freedom associated with @,. For example, 


suppose 
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1 
;= ry = l, 2; . =? 7.4 
t=? J (7-4) 


(7.5) 


where z, is the rth one of 10 measurements selected at random from an 
infinite universe of normally distributed measurements with mean u. 
Suppose, moreover, that we wish to compute a 98 per cent confidence 
intervai for u. For the conditions to be discussed, it is known that the 
ratio 


t = (& — p)/é (7.6) 


will be distributed as Student’s ¢ with 9 degrees of freedom. Since the 
probability is .02 that the numerical value of ¢ for 9 degrees of freedom 
will exceed 2.821, we can compute confidence limits x, and fy by 
substituting 2.821 for ¢ in (7.2) and (7.3), and we can then state that u 
lies within these limits with the assurance that in repeated samples 
selected in the same manner under the same vonditions, a statement 
of that kind about confidence limits computed ihe same way for each 
sample will be correct in the long run 98 per cent of the time. 

For the more general case where @ is the over-all mean defined by 
equation (7.1), let 2; be the mean of g; sample measurements in a sam- 
ple from a component part of the population with mean y;; and let 


w= L wit, j=1,2,---,T, (7.7) 


-> (2, — 2;)*, F 1,2, °**, (7.8) 
———t -1)4 


(7.9) 


where 2;, is the rth one of g; sample measurements, with mean 2;, 
selected from the jth component part of the population. Then it is 
known that the following are sufficient conditions for the ratio defined 
by (7.6) to be distributed as Student’s ¢. 

1. The sample mean, &, is an unbiased estimate of yu. 

2. The z,’s in (7.1) are all normally and independently distributed, 
as when each is the mean of a sample selected from an infinite 
universe that is normally distributed. 

. The w;,’s in (7.1) are all equal to one another. 
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The ideal conditions just described are seldom saiisfied in practical 
probleins; but the actual conditions are frequently such that equations 
(7.2) and (7.3) can be used to approximate a confidence interval. Thus, 
# may not be strictly unbiased as an estimate of »; but in large samles 
correctly designed, the bias is usually small and even negligible. The 
distribution of the z;’s can not be exactly normal in samples from a 
finite population; but the central limit theorem (see reference [4], 213 
ff.) indicates that the distribution must tend to be normal in large 
samples from a much larger population of measurements with finite 
bounds. The situation where the w,’s are all equal to one another, and 
the g;’s are also equal to one another, usually permits us to determine 


TABLE 1 
EXAMPLE OF EFFECT OF EMPTY SUBSAMPLES 








No. of Probability Possible Conditional Expected 
Occupied of Sample Probability Values of 
Samples g Statistics of Rr Statistics 


g Py Rr R, EV 





0 1/27 


1 2/9 


2 and 3 13/120 
combined 





* Sample statistic can not be computed. 


the number of degrees of freedom exactly; but the number can be 
approximated in many other situations by employing formulas to be 
discussed later on in this section. 

For situations where the use of (2.15), (3.22), (4.1), (6.7), and (7.5) 
were suggested for estimating the variance of the over-all sample mean, 
the exact number of degrees of freedom associated with the estimate 
is of the general form T(g—1). Thus, where (2.15), (3.22), or (6.7) is 
applicable, 7’ =1, and the number of degrees of freedom is (g—1). For 
(4.1), T’=m, and the number of degrees of freedom is m(g—1). For 
(7.5), T =1 and g= 10, so the number of degrees of freedom is 9. 
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For stratified sampling, where the w,’s in (4.3) vary from stratum to 
stratum, the number of degrees of freedom associated with the esti- 
mate of the variance of ~ as defined by (4.2) or (5.5) can be approxi- 
mated. When there are just two strata, we can employ Alice A. Aspin’s 
[1] equations, which are of the form 

1 c (1 — c)? 
— = -— + —) (7.10) 
f fi fs 


w,76;" 





= -9 wah) 
W176," -f- wees" 

where w, We, 61, and é: have the same meaning as in (4.3); where f, 
and f, are the degrees of freedom associated with é;* and é,?, respec- 
tively; and where F is the approximate number of degrees of freedom 
associated with é,?. J. W. Tukey (ir telephone conversation, July. 19, 
1951) has pointed out that (7.10) and (7.11) may be written in the 
general form 


an tt . 
Pi } 


<a (7.13) 
eS (7.13) 
DL w*6;? 
for use in situations where there are more than two strata. The transi- 
tion from (7.10) and (7.11) to (7.12) and (7.13) is a straightforward 
exercise in mathematical induction, which can be briefly described. 


Let the estimated variance of the statistic z,, be equal to 


Vin = = w;76;? eT: 14) 
j=l 
for every positive integer m. For some particular integer, p, let F, 
denote the approximate number of degrees of freedom associated with 
V,, and suppose that the reciprocal of F, is 


P { w,6,? : 
Fo = > f| uP Bos} : (7.15) 
j=l > w/?6;? 


j=l ) 


where f; is the number of degrees of freedom associated with ¢;*. Then 
from (7.10), (7.11), (7.14), and (7.15), the approximate number of 
degrees of freedom associated with V,,; is the reciprocal of F,,,~ where 
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{ Pp 

DL, w;*6;? ere 

j=l ie ‘pt+l Fp+1 
Fy t grt rms ep renee 


p+l pti 


LD wie? | Dd wie? 


j=l jul 


Combining (7.15) and (7.16) leads to 


Poy =< fi p+ 
LD. wie; 


j=l 
Since (7.17) is the same as (7.15) except for the substitution of (p+1) 
for p, it follows that if an equation of this form is true for some integer 
p=, it is also true for the integer m+1, and hence for every integer 
larger than m. But an equation of this form is obviously true for the 
integer m=1. Hence, it must be true for every positive integer. Since 
(7.15) is equivalent. to the combination of (7.12) and (7.13), it follows 
that (7.12) and (7.13) are applicable to stratified samples for any 
number of strata. 
It turns out that (7.10), (7.11), (7.12), and (7.13) are identical with 
Satterthwaite’s [9] formula (7). See comments in his article, and refer- 
ences to earlier papers. 


2 


26 2 
pe Bat (7.17) 
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SOME THEORETICAL ASPECTS OF THE LOT PLOT 
SAMPLING INSPECTION PLAN* 


Lincotn E. Mosss 
Stanford University 


The Lot Plot Plan is a modified variables sampling inspec- 
tion plan which calls for taking 50 observations and which 
makes use of the sample histogram. Depending upon the »n- 
pearance of the histogram, various rules for acceptance sve 
used. It is shown that despite use of a histogram no such p!*n 
ean be distribution-free. The behavior of the plan is studied 
where the lot distribrtion is any of several “nearly normal” 
distributions including Student’s t, the symmetric Beta dis- 
tribution, the x? distribution and the mixed normal distribu- 
tion. The probability of acceptance as a function of the irac- 
tion defective depends strongly on the distributional form of 
the lot. Little protection against non-normality is obtainable 
from study of the histogram for 50 observations. 


INTRODUCTION 

NDUSTRIAL sampling plans for the most part are of one of two 

types. 

1. Attributes plans: Here a sample is taken and each item in the 
sample is adjudged defective or nondefective. (For example, is 
a rivet long enough, or will a bolt pass through this hole but not 
pass through this smaller hole?) If sufficiently few items in the 
sample are defective, the lot is accepted since the evidence in- 
dicates a small proportion of defective items in the lot. 

2. Variables plans: Here a sample is taken and each item in the sam- 
ple is measured. (The question is, “How long is this rivet?”, rather 
than “Is it long enough?”.) From the mean of the observations 
and some measure of product variability, such as the sample 
standard deviation, or range, or a known value of the true process 
standard deviation, a decision is made as to the fraction of defec- 
tive items in the lot. Making this decision involves the assumption 
that the distribution of measurements in the lot is normal. 





Variables plans ordinarily require many fewer observations than 
attributes plans because they exploit the information about the shape 
of the distribution. On the other hand if the distribution is not norma! 
this may vitiate the decisions reached from a variables plan; whereas, 





* Prepared under contract N6onr-25126 (NR-042-002) for Office of Naval Research. 
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the decisions reached from an attributes plan are in no way disturbed. 

Both types of sampling plans may be regarded as alternatives to 
100% inspection. Where it is imperative to assure a very small fraction 
defective remaining in accepted lots, 100% inspection may be used. 
However, much experience indicates that 100% insnetion is often less 
than 100% effective—that defective items, though mapected, may not 
be identified and removed. Since this sort of con ition arises largely 
from monotony and inspector fatigue, it is often fe’t that for large lots 
a good sampling plan is a better recourse than 100% inspection. In 
addition 100% inspection is ordinarily much more expensive than 
sampling inspection. 

During recent years there has been widespread use of a third sort of 
substitute for 100% inspection. This is the Lot Plot Plan, developed 
by Dorian Shainin, which is a sort of modi‘ied variables plan. It has 
a novel feature however; it is intended to have the advantages of a 
variables plan without being sensitive to non-normality. In striving 
to achieve this objective, much reliance is placed on the appearance 
of the sample histogram (the sample consists of 50 items). The Lot 
Plot Plan is specifically proposed as a method for assuring smaller 
fraction defective in accepted lots than can be achieved by 100% in- 
spection—or even multiple 100% inspection [9]. 

It is the purpose of this paper to consider theoretical aspects of the 
Lot Plot procedure. The reader is also referred to a paper by C., C. 
Craig [3], and toa reply by Shainin [10], with rejoinder by Craig [4]. 


BRIEF DESCRIPTION OF PLAN 


The inspector draws a random sample of 50 items from the lot. He 
records the measurements in sets of five, at the same time plotting them 
on a conveniently arranged specially prepared form. When all 50 have 
been measured and recorded he has a histogram of from 7 to 16 inter- 
vals' (which facilitates quick calculation of ) and the data themselves 
recorded in consecutive sets of five (which facilitates the calculation 
of #;—the average of the ranges of the 10 subsamples). At this stage 
the inspector has three things: 

1. The histogram (the Lot Plot) on which are shown the specification 

limits. 

2. , the sample mean. 

3. #, the average rang... 





1 After the first five observations have been drawn the inspector applies a rule of thumb given him 
in order to decide whether the interval of measurement is too coarse or too fine to result in such a num- 
ber of intervals for the histogram. 
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How the statistics are used in reaching a decision depends upon how 
the histogram looks. The histogram may appear to be one of several 
types listed by Shainin: 

. Normal 

. Skewed 

. Multimodal 

. One sided (for concentricity, parallelism, etc.) 
». Flat topped 

. Peaked 

. Truncated 

. With strays 


If the histogram looks normal then the “lot limits” are calculated as 
ULL = @ + 1.37 
LLL = @ — 1.37 


and the lot is accepted if both lot limits lie inside specification limits. 
If a lot limit lies beyond a specification limit, then the fraction of the 
lot lying beyond that limit is estimated (using a convenient chart) 
from the normal distribution, taking z for the mean and # for 2.326c. 
After the fraction defective is estimated and entered on the Lot Plot, 
the entire sheet is sent to a Salvage Review Board who decides dispo- 
sition.” 

If instead of looking normal the histogram looks skewed, the inspec- 
tor calculates the two lot limits differently. He considers the mode as 
the origin and computes éy the root-mean-square average of the obser- 
vations greater than the mode (including half of the frequency at the 
modal ceil) and sets the upper lot limit at mode +3év; similarly he 
constructs the lower limit at mode—3é, where é, is the root-mean- 
square average of the observations below the modal cell (including 
half of the frequency at the modal cell). If both lot limits lie within the 
specification limits the lot is accepted. Otherwise, the proportion de- 
fective at each specification is estimated as before, but taking the mode 
for the mean and the appropriate one of év or éz for the standard 
deviation 

If instead of looking normal the histogram looks multimodal, the 
inspector calculates the lot limits still differently. In this case he con- 
siders the two “outboard modes” and if there are sufficient cases be- 
yond them he calculates separately a éy at the upper one and a é, at 
the lower one and sets: 





2 It is to be observed that in some plants the sample standard deviation, s, is calculated and used 
for an estimate of the process standard deviation. In this case the lot limits are #+ 3s. [1] 
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ULL = largest mode + 3éy 
LLL = smallest mode — 3é, 


If there are only a few cases lyinr beyond either or both of the out- 
board modes, he first draws some additional observations and then 
computes the lot limits. If one of the lot limits lies beyond a specifica- 
tion limit the per cent defective there is estimated by taking the rele- 
vant one of éy and é, and the appropriate mode as the standard devi- 
ation and mean, respectively, of a normal distribution. 

The other cases listed above carry specific instructions but will not 
be sketched here because: 


1. 


2. 


They are cases which arise less frequently in practice than the 
three above [4]. 

Description of these cases is not essential to an exposition of the 
evaluative work being reported on here. 


The following remarks are in order at this point: 


If sample items lie outside the lot limits, the lot is not regarded 
as “normal”, but either as “long tailed” or as “with strays”. 

. If a histogram looks normal and lies well within the specification 
limits, the inspector may accept without calculating any statistics 


1. 


2 


at all. 


. The inspector never rejects a lot. He refers it (via the executed 


Lot Plot form) to the Salvage Review Board (often interdepart- 
mental in composition). This board weighs the evidence in the 
Lot Plot form, and economic considerations such as production 
line needs, existing inventories, disassembly costs, etc., in arriving 
at a disposition such as screen, return to vendor, accept, scrap, 
remove some good items and rework others, etc. 


. The apparent vagueness in such phrases as “looks normal” is a 


real and genuine vagueness in the plan as published. In general, 
objective criteria (except for (1) above) are not given. It is thus 
entirely possible for two inspectors to arrive at different disposi- 
tions of the same lot from the same sample. 


NATURE OF A FULL EVALUATION OF THE PLAN 


It has already been said that the plan is intended to be applicable 
regardless of the distribution of the lot. That is, for any kind of lot 
(except possibly for lots containing “strays”) the plan is intended to 
give the (essentially) same tight operating characteristic. 

In evaluating the plan it is thus necessary to consider how it works 
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where the lot may have any one of various sorts of distributions. Since 
only a sample is inspected, a lot which is actually, say, skewed will 
sometimes give a histogram which appears to be normal, sometimes 
skewed, sometimes flat-topped, sometimes bimodal, etc. Then after a 
certain type of analysis is decided upon (from looking at the histogram) 
the sample may give lot limits leading to acceptance, or it may not; 
and the probability of acceptance presumably should be different for 
the same lot depending on what form of analysis is used. 

Thus, if f(Ai, A2, « + « , Ax) is a given distribution type with parame- 
ters Ay, Ae, - - \? Ax, a full evaluation for this distribution type would 
require determining: 


P;(a| di, Xe, '* ++, Ae). The probability that the histogram would 
“look like one of type a”. 


Ly(Aa, Ae, © + *, di | @) the probability of acceptance (as a function of 
the parameters), given that the histogram looked like one of type 
a. 


It would then be possible to obtain the over-all probability of accept- 
tance for the distribution type f (as a function of the parameters) by 
the equation. 


L(M1,%2, He ae Ne = > LQ, dz, tt Se » re | a)P (a | Au, As, O) Bi ee » Ax) 


when the summation is over-all lot plot “types.” 

There would then remain the question, “Is Ly(Ax, Az, - « « , Ax) actu- 
ally only a function of p, the fraction defective?” If so, it could be 
written as L,;(p). 

If this sort of investigation were then done for many different distri- 
bution types, a very complete account of the plan’s properties under 
the circumstances for which it was designed would be obtained. 

Carrying through such a complete study is at best horrific i: princi- 
ple. In this case it is also actually impossible because of the fact that 
the plan gives no objective criteria for concluding that a Lot Plot “looks 
skew”, etc. 

Necessarily then this paper will give a more modest evaluation than 
a complete analysis would provide. 


A COUNTER EXAMPLE 


The Lot Plot Plan cannot achieve the objective of giving a tight 
O. C. for every distribution.’ In fact it is easy to show that for certain 





* Shainin’s section “How to Handle Strays” [9, p. 25] effectively acknowledges this fact; there he 
points out “1% strays will get by 61% of the time, 2% strays 37%”. 
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kinds of distributions it must behave almost exactly like an attribute 
plan calling for 50 observations and rejection if one or more defective 
items are found. Consider a lot with the following composition: it con- 
tains N items whose distribution is centered half way between the two 
specifications, the standard deviation is so small that the distance 
between the specification limits is many standard deviations (such as 
10 or 20), and its shape is as nearly normal as is possible for a discrete 
distribution; in addition it contains k “mavericks”—items which are 
outside the specification limits in any way whatever. Now consider the 
operating characteristic. Whenever the sample of 50 items includes 
none of the k mavericks, the Lot Plot will lead to acceptance. When- 
ever the sample contains one or more of the k items the Lot Plot will 
say “refer to salvage”. The plan will thus behave exactly like the attri- 
butes acceptance plan mentioned above. 

A stronger statement can be made. It is impossible to construct any 
plan, using 50 observations, which has both of the following properties: 


a. There exist lots of such high quality that the plan is nearly sure 
to accept them. 

b. The plan is materially tighter than a 50-observation attribute 
plan for every kind of lot distribution. 


A formal statement and proof is as follows: 

Any acceptance plan which requires n observations and has the 
property that for every lot size, N2n, and every e>O there exist lots 
(of “high quality”) such that the probability of acceptance, A, exceeds 
1—e, also has the property that for any given p there exist lots with 
fraction defective greater than p for which A,>A4—e where Ayg is 
the probability of acceptance using an attribute plan with sample 
size n and acceptance number zero. Proof: 

Let L be a lot of size N2n such that A,(L)>1—«. 

Consider the lot L’(k) formed by adding K defectives to L where 
k is chosen to ensure k/(N +k) >p. 

Then 


A,(L’) 2 Prob {Sample consists only of items from Z and is accepted } 
= Prob {n observations from L} A,(L) 


(;) (7) = 


SE ne ap Pimento an gt hgh) 








rede meas 





90 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1956 


The nature of the counter example deserves some consideration from 
the practical point of view. It surely indicates that where a small 
fraction of strays constitutes a serious problem, the Lot Plot Plan 
cannot afford good protection. On the other hand it leaves open th# 
question, “How should it behave in situations where we have nor- 
mality, or where the non-normality is not a matter of strays?”. 


THE USE OF A 50 OBSERVATION HISTOGRAM TO DETECT NON-NORMALITY 


The most distinctive feature of the plan is the use of the histogram 
to warn the user when the normality assumption is invalid and to 
cause him to employ special techniques. Thus it is natural to inquire 
as to the efficacy of the plan in this respect. 

Craig [3] reports the results of some sampling experiments in which 
he forcefully raises the question, “What can one learn about the shape 
of a distribution from a histogram based on 50 observations?”. Other 
sampling experiments have been done again casting severe doubt on 
the possibility of greatly profiting from a 50-observation histogram. 
Here a more theoretical view will be taken. 

Because the Plan is very vague as to what sorts of lot plots are to 
be regarded as skew, what sorts as normal, what sorts as bimodal, etc., 
it is hardly possible to answer the question “How reliably will the 
Plan protect against non-normality?”. Instead we will ask how well 
can any procedure using 50 observations in grouped frequencies 
identify non-normality? 

Since it is the aim of the Plan to give protection against many kinds 
of departure from non-normality it is fair to focus our attention on 
“omnibus” or “shotgun” tests. We know of one such test, whose power 
we can compute, and which has the encouraging property of being a 
likelihood ratio test (for grouped frequencies against the clas: of all 
alternatives). This test is the x? test. Accordingly we computed the 
power of the x? test against various non-normal alternatives. See Figs. 
1, 2, 3, 4. This was done in the following way. The null hypothesis 
specified a unit normal distribution; the cells for the x? test were 
those given by the deciles of the unit normal (this should give a more 
sensitive 10-cell test than equal length cells according to results of 
Mann and Wald [6]). Under the null hypothesis the expected cell 
frequencies are all 5. For various competing distributions standardized 
to have mean zero and unit variance the cell expectations were evalu- 
ated. From these the power of the 10-cell x? test could be evaluated 
using methods given by Patnaik [7]. The results are shown in Table 1. 

Another approach to the problem was also taken. The same com- 
petitors and cell intervals were used but the test was allowed in each 
ease to be tailor made for maximum power against the alternative. 
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This is accomplished by using the Neyman-Pearson criterion for test- 
ing a simple hypothesis against a simple alternative. The hypothesis 
is that the chosen intervals have the probabilities {p,;} specified by the 
normal distribution; the alternative is that the chosen intervals have 
the probabilities { p,’} specified by the particular alternative distribu- 
tion. The Neyman-Pearson criterion then gives the optimum test of 
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t 


Fia. 4 


the hypothesis against the alternative; optimum in the sense that no 
more sensitive test can possibly be constructed.‘ 
Let the observed frequencies be 


Ng #=1,---10. 





* The power of this test for any given significance level would necessarily exceed that of the Lot 
Plot test of normality if the Lot Plot used the same choice of cell intervals. 
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The Neyman-Pearson test is to reject for sufficiently small values of 


Or if we define w;=log p’;/p; we reject for sufficiently large values of 


10 
zs Nw. 
t=1 


Since the n; have a symmetric multinomial distribution under the null — 
hypothesis we should expect the distribution of the weighted sum to 
be fairly well approximated by the normal distribution with 


mean = 50 >, wp; 
variance = 50 >> w2p; — 50( >> wep,)?. 
Thus the test with significance level a becomes, reject if: 





De nits >5 : wy + 2-9 4/5 > we ° $( > w)? 


when we define z"-® by: 


1 a(«) 
€= —f et itt 
V2r —« 


TABLE 1 


POWER OF x? TEST AGAINST VARIOUS NON-NORMAL 
ALTERNATIVES 








LEVEL or SIGNIFICANCE 





DISTRIBUTION 





ty 

ts 

his 

8(3, 3) 
8(10, 10) 
r(3) 
r(8) 
T'(15) 











* Mixed Normal (p; o:°, os"; w:, #s); density function is 


ee o-o4 et 4 1 — p) : —— elon V0 3, 
Ve 


oiv2e o 
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Under the alternative hypothesis, {p;’}, the statistic )-12, nw; will 
have 





mean = 50 >> wy p,’, and 
variance = 50 >> p;‘w, — 50( >> piw,)? 


again using a normal approximation, the power of the test which is: 


P} Sours he + 2-0 4/55 Ewe — 2 are ta i} 


may be approximated as: 


—50 >d wip —p)+ -o4/s/ x. w?Z — “| 
V/50 P pw? — 50( > Di ‘w,)? 


where z has a unit normal distribution. 

The power of this test against the same alternative distributions is 
shown in Table 2. Admittedly study of the values in Tables 1 and 2 
does not tell us directly how the Lot Plot “test of normality” will be- 
have for these distributions. Both of the tests considered, however, do 
enjoy certain optimum properties, and there is every reason to suppose 














Pris > 


TABLE 2 


POWER OF NEYMAN-PEARSON TEST OF NORMALITY 
AGAINST VARIOUS ALTERNATIVES 























LEVEL or SIGNIFICANCE 

DIsTRIBUTION : 
-05 25 
ts .310 .689 
ts .153 .483 
tis .098 .376 
B(3, 3) .160 .489 
B(10, 10) .075 .319 
r(3) -605 .899 
r(s) .312 .685 
T(i5) -216 .565 
MN(4; 1, 1; 4, —4) 058 .273 
MN(4; 1, 1; 1, —1) . 167 .505 
MN(4; 1, 1; 3, -) .627 .912 
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that the combination of an inspector and a Lot Plot diagram will yield 
a test of normality less sensitive than the second of these and probably 
less sensitive than either. If this be granted then, from a study of either 
Table 1 or 2 we reach two conclusions. 


1. The Lot Plot Plan, relying on a 50-obser.‘ation histogram to 
identify non-normality, cannot be relied upon to protect from dis- 
tributions as nearly normal as these. 

2. Distributions as nearly normal as these will very often be regarded 
as normal, and it is worthwhile to see what sorts of decisions will 
be made when the normal analysis is applied to them. 


EF¥YECTS OF USING THE “NORMAL ANALYSIS” 


A. General remarks. We now concern ourselves with examining the 
results of an acceptance procedure which accepts if both 


7 f 
#4-3— and #-—3— 
d, ds 


lie between the specification limits, and which otherwise rejects. Since 
the Lot Plot is sometimes used with the sample standard deviation 
rather than #/d, for the estimate of o, the problem will also be investi- 
gated for that case. The behavior of these procedures will be examined 
for the normal distribution and a few members of four types of non- 
normal distributions (which first appeared in the last section). These 
four types all share the property that the normal distribution is a 
limiting case. As the parameter in either the ¢ or I distribution increases 
the snape of the distribution increasingly resembles the normal; 
similarly, as the two parameters of the symmetric Beta distribution 
increase it goes to the normal. Finally, as the parameters pu; and ye tend 
to zero in the mixed normal with o;?=o;?, the distribution tends to the 
normal. In addition, all of these distributions except for the extreme 
mixed normal are “bell-shaped,” and all of them except for the Gamma 
distributions are symmetric. We have aiready seen that if indeed the 
distribution of the lot is one of these types there is a great probability 
that it will be subjected to the normal analysis. 

In a certain sense’ it can be said that the plan “aims ”to accept if the 
points u+3¢ and u—3e both lie within the specifications, and other- 
wise to reject. Now it is clear that “3-sigma” criteria mean very differ- 
vat things for different distributions. Table 3 displays the probability 

5 If £+ 3s be used, as the sample size becomes large the probability tends to one that the lot will be 


accepted or rejected as u + 3e both lie (strictly) within the specification or one at least lies (strictly) 
outside. 




















96 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1956 











’ TABLE 3 
PROBABILITY LYING BEYOND (u—3¢, 4 +3c) FOR VARIOUS 
DISTRIBUTIONS 
DisTRIBUTION PROBABILITY 
Normal .0027 
ts .0117 
ts .0085 
tis .0062 
B(3, 3) 0. 
B(10, 10) .0009 
r(3) 0. below 
.0103 above 
r(8) 0. below 
.0071 above 
r(15) .0000 below 
.0054 above 
MN(4; 1, 1; 4, —4) .0022 
MN(4; 1, 1; 1, —1) .0006 
MN(4; 1,1; 4, -) -0000 








lying beyond u4+3e and u--3e for each of several non-normal distri- 
butions. From perusal of these values we would guess that as #+3s or 
£+3(#/d.) were successful in approximating » +3, so also would lots 
of identical quality tend to be accepted with widely different prob- 
abilities, depending upon the shape of the distribution. 

B. The case of normally distributed product. Where the product is 
normally distributed the joint distribution of 2, s is, of course, exactly 
known and numerical integration enables the probability of acceptance 
to be evaluated for any relation of » and ¢ to the specification limits 
U and L. As is well known [11], the probability depends not only on p, 
the fraction defective, but on how the fraction defective is divided; if 
p/2 lies beyond each specification, then the probability of acceptance 
is greater than if all of it lies beyond one specification. (Variables plans 
exist for which the O.C. effectively depends on p alone [2, 9]; the Lot 
Plot Plan is inferior to them in this regard). Figure 5 shows the O.C. 
as a function of p for various relations of u to the specification limits. 

If Z and # be used, then the exact distribution of 7 /3 not available. 
However, an approximation due to Patnaik [8] is kr-~n1 to be ex- 
cellent. Further, # and 7 are independently distributed in the normal 
ease. Thus the O.C. can be obtained by numerical integration. 

In either of the above cases another method of approximately 
evaluating the O.C. is available. We know that Z is exactly normally 
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TABLE 4 


VALUES OF p FOR WHICH PROBABILITY OF ACCEPTANCE IS 
EQUAL TO .95, .90, .10, .05, CALCULATED FOR RANGE AND 
STANDARD DEVIATION PLANS BY BOTH EXACT AND 
APPROXIMATE METHODS 





























PropaBmiry Onwz-Sipep Two-Sipep 
or Acczpt- ; * 
ANCE 05 | .10 | 90 2 | .08 | .10 .90 95 

Exact .0086*| .0060*) .00024*| .00013*) .0116 | .0081 | .00034 00019 
Range Normal 

Approximation! .0084 | .0057 | .00022 | .00012 | .0115 | .0078 | .00031 | .00018 
Standard Exact .9073 | .0052 | .00032 | .00020 | .0098 | .0070 | .00046 00030 
Deviation | Normal 

Approximation} .0071 | .0051 | .00030 | .00019 | .0101 | .0070 | .00046 00032 














* Patnaik approximatic a. 


distributed; 7, being the average of ten independent ranges, may be 
considered approximately normally distributed, and s the sample 
standard deviation is asymptotically normally distributed. Thus we 
may consider taking #+3s and #—3s to have a bivariate normal dis- 
tribution. Similarly we may consider taking z+ 1.37 to have a bivariate 
normal distribution. Table 4 shows the values of p for which the prob- 
ability of acceptance (from normal lots) equals .95, .90, .10, .05 as 
calculated from the bivariate normal approximation, and by the 
exact method for s, by Patnaik’s approximation for 7. The agreement 
is seen to be good. 

C. Methods used for case of non-normal distributions. When product 
is not normally distributed the exact distributions of #, s and # are all 
unknown for the distributions considered here. This means that either 
approximation to these distributions or empirical sampling must be 
used to evaluate the probability that the lots limits (using either s or 7) 
will lie inside the specification limits. The course chosen in this investi- 
gation was to take the lot limits as having bivariate normal distribu- 
tion. It has already been seen that this approximation is excellent for 
the normal, and it can fairly be hoped that it will not lead to grossly 
misleading results for these distributions which are “nearly” normai. 

To apply the bivariate normal approximation it is necessary to ob- 
tain its parameters. The mean and variance of # are of course » and 
o*/50, when u and o? are the mean and variance respectively of the dis- 
tribution postulated for the lot. The mean and variance of rs, the 
range of five observations, were obtained by numerical evaluation on 
the CPC of the integral expressions (given in [5, p. 223]) which are, 
respectively: 
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Ber) = f 1 - a - Few) - (Pw))*Iau 


E(rs)? = 2 f f { ([F(u) — F(v)]§ — [1 — F()]* — [F(u)]* + 1} dudvs 


In these expressions F(u) is the cumulative distribution evaluated at u. 
Values of E(rs/c) and o(rs/c) for distributions are given in Table 5. 
» TABLE 5 
VALUES OF E(rs/c) AND oa(rs/e) FOR VARIOUS DISTRIBUTIONS 








DIsTRIBUTION E(r;/c) (r/c) 








ts 2.24775 1.08931 
ts 2.29378 0.98788 
tis 2.31067 0.93279 
te 2.32593 0.86409 
a(1, 1) 2.30940 61721 
8(3, 3) 2.32105 .79568 
8(10, 10) 2.33120 .82743 
B(@, =») 2.32593 .86409 
r(3) 2.25928 .96704 
r(8) 2.29588 .91209 
r(15) 2.30895 .89157 
r(@) 2.32593 .86409 
MN(4; 1, 1; 4, —® 2.31450 .68094 
MN(4; 1, 1; 1, —1) 2.33028 .77979 
MN(}; 1, 1; 4, —4) 2.32751 .85529 
Normal 2.32593 . 86409 








It is interesting to observe how stable is the ratio E(rs/c) from distri- 
bution to distribution. 

In the case of symmetric distributions the correlation between the 
range and the sample mean is zero since they are, after translation, 
even and odd functions, respectively. For the Gamma distributions the 
covgriance between the mean and range of five observations was ob- 
tained by numerical evaluation of the following integral expression: 


(20p + 24)(2p + 1)! 
2*9+2p!p! 





E(srs) = 


= (60p + 72) f F(2)(1 — F(@))af(a)*de 





* The final term of the integrand in Kendall's expression (9.58) should be preceded by a plus sign; 
it is erroneously printed with a minus sign. 
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when the Gamma density and cumulative function are, respectively: 


1 
f(z) =l—lte*x 
Pp! 


1 £ 
F(z) = — f e~*tPdt. 
pido 


In terms of these parameters of the joint distribution of Z and 7, the 
bivariate normal approximation to the joint distribution of the lot 
limits is to take: 


E50 + 1.375 — 5 1.3E (rs) 








/2 PGK” Se, | Mao 
area J —— COV,(2 
50 10 10 " 


— 1.3%; — w» + 1.3E(rs) 


a + 1.69 = = (2srs) 
10 co 5 








as having a bivariate normal distribution with means zero, variances 
one, and correlation coefficient: 
o : 
— — 1.69 
50 


Pu = V(e eum =) (= 6 ) 
4 d — |—cov 
50 10 hl 


All the expressions under the square root signs simplify somewhat for 
the symmetric distributions. 

A similar approximation was applied to the problem when the lot 
limits are taken as +3s and #—3s. Application of standard methods 
[5, Ch. 9] yields the following results: 


or," 








E(%) =u 


1 Me 
Pee he 
(s) o( + $y .) 
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bs 
cov (Z, 8) = 
, 8) 2No 
var = = o*/N 
ba — of 
var § = —————_-: 
4(N — 1)o? 


In terms of these parameters (approximate in the cases involving s) 
the bivariate normal approximation to the joint distribution of the lot 
limits is to take: 























ba 
eee (1 Sit : 
on tg a ae 40004 
u’ = = 
a? 9 pw—ot 3us 
50 4 (49) 
and 
~e— 948 (+= Bay <-) 
50 wt+3e(1+ waet 
Sf = 
o 9 a4 3 
CI tk 
50 4 49% ~ B0c 


as having a bivariate normal distribution with means zero and vari- 
ances one, and correlation coefficient: 


o* 9 w-—o* 


50 4.49 a 


pu'y’ = = — 
Vem) el 
50 4.49 o 50¢. 
Again there is simplification for the symmetric distributions since the 
third central moment, us is then zero. 

D. Resulis for non-normal distributions. The distribution theory 
whose development has just been sketched enables one to evaluate the 
probability that the lot limits will lie between any given lower and 
upper specification limits, L and U respectively. Since for any given 
distribution the fraction defective p is defined as the probability lying 
outside the interval L to U we can find both the value of p and the 
probability of acceptance associated with any pair of specification 
limits. 
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In the normal case it was pointed out that the probability of accept- 
ance is not a function of p alone, but also of how p, that fraction defec- 
tive, is divided between the two ends of the distribution. In general 
we should expect this condition to obtain for other distributions as 
well. Therefore, in the case of each of the symmetric non-normal dis- 
tributions taken as illustrative examples in this study two of the infi- 
nitely many divisions of p were considered : equal division of p between 
the two ends, and all at one end. In the case of the non-symmetric 
distributions there were three cases investigated: equal division of p, 
all at the left end, all at the right end. 

Table 6 presents the results obtained. The values of p leading to 
probability of acceptance (if the “normal analysis” is always used)’ 
equal .9 and .1 are shown there for several distributions, for symmetric 
and one-sided division of p, and for both “range” and “standard devi- 
ation” lot limits. 

Study of the figures in this table indicates several things: 

Use of range or standard deviation leads to nearly identical results. 

The plan is extremely “tight” for the normal distribution. _ 

The plan is far tighter for some others of these distributions (for 

example the B(3, 3) distribution lies entirely between U and L 
(#=0) and still the probability of acceptance is less than .1). 
The plan is very much less tight for some distributions (such as the 


t or the I with all the fraction defective in the upper tail). 
Skewness results in a violently strong dependence of L(p) upon the 
division of the fraction defective between the tails. 


In considering the probability that the lot limits lie between the 
specification limits as a function of p we find a great diversity, even re- 
stricting ourselves to the symmetric distributions. Instead we may 
examine the probability that the lot limits lie between the specification 
limits as a function of how wide apart they are (in terms of the lot 
standard deviation). This is done in Table 7. Here we see a rather sur- 
prising degree of uniformity. All these O.C.’s could easily be plotted on 
the same chart; in fact their 50% points all lie between .95 60/(U — 
and 1.00 6¢/(U—L). 

This observation together with a remark made much earlier helps 
to make sense of the behavior of the plan. We have just seen that the 
probability of obtaining lot limits between the specification limits de- 
pends upon the distance between the specification limits (U—L)/6¢ 
in much the same way for all these “nearly normal” distributions. On 





1 That is, the probability thai the lot limits lie between U and L. 
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TABLE 7 


PROBABILITY OF ACCEPTANCE BY LOT PLOT “NORMAL 
ANALYSIS” AS A FUNCTION OF THE STANDARD 















































DEVIATION 
6c 
DistrisvtTion U-L 
-80 85 -90 95 | 1.00 | 1.05 | 1.10 | 1.15 
ty range .94 86 -73 .60 .46 .B5 -25 17 
standara deviation .87 -78 .67 58 47 .38 -30 -23 
te range .94 -85 -72 56 -41 -28 .19 .13 
standard deviation .94 -85 .74 59 41 -28 .19 13 
te range -95 4 .69 .53 .38 25 -16 .10 
standard deviation .97 .88 -75 .58 .40 -26 15 .09 
(3, 3) range .97 .88 .73 .54 -35 .21 .12 -06 
standard deviation | .99 -95 82 .58 .34 17 .08 .03 
(10, 10) range .96 . 86 71 -52 .35 22 -12 .07 
standard deviation .99 -93 .79 .59 .38 -21 -10 -05 
MN(4, 1,134, -—® range .95 . 86 -70 -52 34 .22 13 -08 
standard deviation .98 -92 .78 57 .37 .23 .12 -06 
MN(4;1,1;1, —1) range . 96 .88 -72 .52 33 -19 -12 -06 
standard deviation | .99| .94| .79] .59] .38/ .19| .10| .04 
MN(};1,1;4 -® range .99 -92 .76 .54 .30 .17 .09 04 
standard deviation | 1.00 .96 .85 .60 .33 14 .05 .02 
Normal range 
standard deviation 





the other hand, we earlier saw that u+3e limits have greatly different 
values of p associated with them. From these two facts we could predict 
what was shown in Table 6, that the behavior of the plan in terms of 
the proportion defective depends strongly upon. the type of distribution. 

E. Estimation. When the Lot Plot looks normal, but the lot limits 
do not both lie between the specification limits, the lot is referred to 
salvage with an estimate of the fraction defective beyond each limit. 
These estimates are: 


=— e-*/4dt an =| e~* /*dt 
a V 24 J (us i2 bt a ‘ 


where 6 is either s or 7/dz. 
The question naturally arises, “What properties have these esti- 
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mates?”. The answer would seem to be that for distributions, such as 
the t, with much probability in the tails, the estimates above will tend 
to be underestimates, and for distributions with little in the tails (such 
as the Beta) the estimates will tend to be ovcrestimates. For example 
we see from Table 6 that if the distribution of product is B(3, 3) with 
actually zero fraction defective we may still accept with probability 
less than .1; the case shown in the table is for the specification limits 
at the ends of the (finite) range of the distribution. In such a case 
whenever we reject we estimate some positive value for fy or for pz, 
or for both; whenever the lot is accepted the estimate for both is zero. 
Clearly the expected value of /v+ x exceeds zero, the actual fraction 
defective. Similar reasoning confirms the statement made for long- 
tailed distributions such as the ¢. The simple fact would seem to be that 
though one can estimate u and o with fair success for many distribu- 
tions, since identical pairs of values for » and o denote greatly different 
probabilities in the tails for different distributions, we cannot hope to 
estimate these probabilities with uniform success by always pretendizz 
the distribution is normal. 


NON-NORMAL ANALYSIS 


Although, as we have seen, for non-normal distributions of our illus- 
trative types the “normal analysis” will often—or indeed, usually—be 
used, it is profitable to investigate what will be the effects of another 
type of analysis when used. 

In all, about 11 different sorts of analysis are distinguished in the 
plan. Only two of these, besides the normal analysis, receive attention 
in this study. These are the skew and multimodal cases. Since the 
special procedures in both these cases depend upon the mode it is very 
difficult to make any study of sample behavior; little if anything is 
known about the sampling distribution of the mode. However, we can 
again “look at the problem in the parameters”. If the inspector knew 
the lot was skewed (or multimodal), knew the mode(s), and knew the 
true values of cy and ez, then how would lots of various quality fare 
under the rule(s) which set the lot limits at 


rode — 3¢, and mode + 3eyv 
for skewed lots, or 
smallest mode — 3c, and largest mode + 3cyv 
for multimodal lots? Table 8 shows that the rule—if the parameters 
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were known—would lead to very different results depending upon what 
distribution, or which end of the distribution, is being considered. 

It can certainly be said that there are grounds for doubting that the 
special rules for these two cases, at least, will go far toward making the 
operating characteristics of the plan independent of the distribution of 
of the lot. 


t 


TABLE 8 


PROBABILITY ABOVE UPPER MODE+3eyv AND BELOW LOWER 
MODE-—3ez, FOR VARIOUS DISTRIBUTIONS 














DISTRIBUTION PROBABILITY ABOVE | PROBABILITY BELOW 
r(3) -0050 0. 
r(5) -0036 -00005 
r(8) -0030 -00023 
MN*(3; i, 1; 4, —%) -00067 -00067 








* MN(4; 1, 1; », —x) is unimodal for » $1. 


SUMMARY AND DISCUSSION 
The results can be summarized as follows: 


1. The Lot Plot Plan fails—as must any 50-observation plan—in its 
objective of being uniformly tight regardless of lot distribution. 

2. The &-observation histogram will usually judge to be normal 
lots which in fact have fractions in their tails appreciably larger, 
or appreciably smaller than has the normal. 

3. The probability that lot limits will lie between the specification 
limits depends upon the distance (in standard deviation units) 
between the specification limits in a way which is not strongly 
dependent upon the lot distribution; but (almost as a result) the 
probability of acceptance depends very differently upon the 
fraction defective for different distrivutions. (That value of the 
fraction defective for which the probability of acceptance is .9 
varies for cases considered between 0, for B(3, 3), and .006, for 
I'(5) ; fractions defective with 10% probability of acceptance range 
from 0, for B(3, 3), to .045, for attributes plan n= 50, c=0). 

4. The procedures for estimating fraction defective will lead to sys- 
tematic overestimates for some distributions, systematic under- 
estimates for others. 

| 5. The procedures for dealing with non-normal distribution are 
questionable. 
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Since many of these conclusions apply entirely or in some degree to 
any variables plan it is fair to ask, “How does the Lot Plot Plan compare 
with other variables plans?”. 

It is clear that when all lots are normal, existing plans are superior 
for several reasons. The lot plot procedure will unfortunately from time 
to time lead to treating a sample as coming from some other kind of 
distribution—say bimodal; existing variables plans will routinely 
treat all samples by those methods which are for the normal distribu- 
tion the optimum ones. Further, plans which give ene probability of 
acceptance for each fraction defective regardless of how it is divided 
between the two ends of the distribution are preferable on those 
grounds alone. 

If lots are not normal then either Lot Plot or other existing variables 
plans may behave in strange ways. When the user requires “iron-clad pro- 
tection” regardless of the lot distributien, an attributes plan is needed. 

Finally, there is a great inflexibility in approaching every acceptance 
sampling problem with the same decision procedure—“Take 50 obser- 
vations at random, and .. . ”. Different problems may legitimately call 
for different procedures. This option is not a part of the Lot Plot Plan 
as it is of Mil Standard 105a, or NavOrd 80 or other standard plans 
which give choice of AQL and sample size. , 

Despite the various shortcomings of the plan which have received 
emphasis here, it has undoubtedly been successful in many installations. 
What features of the plan may have contributed to its success? There 
is no doubt that the plan has a very definite psychological appeal aris- 
ing from literally “seeing a picture” of the sample. The provision for 
always taking the same size sample is administratively (and psycho- 
logically) convenient, however inadequate it may be from some points 
of view. 

If lots are usually normal and the user actually needs a very tight 
plan it is not a bad approximation to existing good unknown standard 
deviation plans, and the usual benefits of well chosen variables plans 
will largely accrue to the Lot Plot user under these circumstances. In 
cases of gross bimodality, for example, the histogram will give definitely 
useful information about the process which generated the lot. 

Finally, the provision that though an inspector may accept the lot 
it requires the Salvage Review Board to dispose of one in any other 
way should have important administrative advantages in many set- 
tings: economic fdctors will tend to be systematically weighed at the 
time of disposition of the lot; if the Salvage Review Board contains 
representation from the departments which (in addition to inspection) 
are concerned, then fewer dispositions of lots should result in difficulties 
such as production flow problems. 
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THE OPERATING CHARACTERISTIC CURVE FOR 
SEQUENTIAL SAMPLING BY VARIABLES WHEN 
THE PRODUCER’S AND CONSUMER’S RISKS 
ARE EQUAL 


Norman R, Garner* 


Thiokol Chemical Corporation 


HENEVER the prod:cer’s risk, a, is equal to the consumer’s risk, 

8, the operating characteristic curves for sequential sampling 

can be reduced to one curve. This applies to sequential sampling to test 

if the mean of a normal distribution with known standard deviation 

falls short of a given value. Briefly, the pian is such that there exist a 

6) and 6; (@.<6,) such that the probability of rejecting a lot whenever 

66 is at most a, and the probability of accepting a lot whenever 

626, is at most 8, where @ is the mean value of the lot being tested for 
a particular characteristic. 

Let a=, then the lot is accepted whenever 


dX S a, = n(s) — In, 
rejected whenever 
DX = ra = n(s) + hi, 
and testing is continued whenever 
an < 4 < Ta, 
where >_X is the sum of n observations, 


Oo + 
2 


l-—a o 
hy = In( ) é 
a 0, — 9% 


For this plan the operating characteristic, L(6), is given by 
A*(A* — 1) 

A*—1] 
* Now at the Naval Powder Factory, Indian Head, Maryland. 
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where 
, l-ea 
A= 
a 
and 
s-— 86 s—0 
|] = eft *) 
& — 4% A-8 














Thus it is easy to see that |h| is symmetric about s, h is negative 
when 6>s, and positive when @<s. 

It can be shown by algebraic manipulation, that 

1 — L(6) 
—hinA = In a). 
L(@) 

Thus we can compute values of —h In A for different values of L(é), 
and because of the symmetrical property of | h| we get the operating 
characteristic curve shown in Fig. 1. Then, once a= is specified, we 
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Fia. 1. Operating Characteristic Curve for Known Sigma Sequential Sampling 
by Variables with Equal Risks 
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easily obtain the operating characteristics in terms of |h|. Finally, 
after #) and @, are specified the operating characteristic curve is ob- 
tained in terms of @ since 


6=s—|h|(s— 6) for 0<s, and 
6=s+  |h|(s— 6) for O><«. 
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TABLE OF PERCENTAGE POINTS OF 
KOLMOGOROV STATISTICS 


Lesiie H. MrituEeR* 
Ohio State University 


A simple method for testing the probability that a set of 
numbers is a sample from a known distribution consists of 
comparing the empirical cumulative distribution function of 
the sample, S,(z), with the known cumulative distribution 
function F(z). Both D,=maximum{S,(z) —F(z)} and D,* 
=maximum| S,(z) —F (x) | are ranciom variables, independent 
of the special form of F(z), if F(x) is continuous. This paper 
contains more extensive tables of the percentage points in the 
distributions of D, and D,* than have been published previ- 
ously. These values are obtained by empirical modification of 
a known asymptotic formula. 


1. INTRODUCTION 


comes from the cumulative distribution function F(x), we may use 
either the statistic 


I. ORDER to test whether a sample of nm numbers (2, 22, - ~~: , Zn) 


D, = maximum {S,(z) — F(z)}, 
or the statistic 
D,* = maximum | S,(z) — F(z)|, 


where S,(x) is the sample cumulative distribution function. 

As an illustration, suppose that we wish to test the hypothesis that 
the five numbers .52, .65, .13, .71, and .58 were chosen at random 
from numbers uniformly distributed between zero and one. Then 
S;(x) has the value of zero between +=0 and z=.13. At x=.13, S(x) 
is discontinuous, since its value jumps from 0.0 to 0.2. It is easy to 
sketch the step-function, S;(x), and compare it with F(z), which is z 
for a uniform distribution between 0 and 1. 

The maximum difference must occur at one of the jump points of 
S,(z). If we tabulate the data of the above illustration, we find that 
Ds=.29 and D,* =.32. 





* This study was supported in part by funds granted to Ohio State University by the Research 
Foundation for aid in fundamental research. The author acknowledges the assistance of Nelson Prentiss, 
Vincent Donato, Milton Glans, and Richard Thomas in carrying out numerical computations. 
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At right end point 
Interval S;(z) {Ss(z)—2z} | Ss(x) —2| 





00 <z<.13 R + .07 * 

-138<z< .52 ‘ —.12 ‘ D;* = .32 
.562 <2 < .58 ‘ + .02 

.58 <2 < .65 “ +.15 > 

.65 <2<.7) . + .29 p D,= .29 
71<2<1.0 g .00 


We shall see that the number .44698, found in Table 1 for n=5 and 
a=.10, means that we expect D; to exceed .44 more than 10 per cent 
of the time when a sample of 5 numbers is picked from a uniform dis- 
tribution. §'-ce D;=.29 in our example, we cannot, at the 10 per cent 
level, rejec. the hypothesis that the numbers belong to a uniform dis- 
tribution. Likewise, the number .50945 found in Table 1 for n=5 and 
P=.90, means thrt D,* will exceed .50 about 10 per cent of the time 
when samples of * numbers are picked from a distribution with the 
continuos cumulative distribution function F(z); 

When (2), 22, - - +, %,) are mutually independent and all come from 
the same c:.utimuous distribution function F(x), the distribution of D, 
does not depend on F(z). This means that a table used to test the 
hypothesis that numbers come from a uniform distribution may also 
be used to test the hypothesis that numbers come from a normal dis- 
tribution, «r fvora any completely specified continuous distribution. 

The statisti: D, can be used to test the hypothesis that the observa- 
tions come fru... F(z) against the alternative that they come from G(z). 
If it is assumed that G(x) = F(x+@) where O>0, i.e., G(x) is the same 
as F(x) except shifted to the left, then a test based on large values of 
D, would be a reasonable procedure. For @ <0, small values of D, would 
be appropriate, or large values could be used by replacing observations 
by their negatives and replacing F(x) by 1—F(—z). The statistic D, 
can be used for testing more general hypotheses, such as G(x) 2 F(z) 
for all z. 

Let a= Prob. (D, 26). Table 1 contains corresponding values of ¢ for 
a=.10, .05, .025, .01, and .005, for values of n from 1 to 100. 

Let P= Prob. (D,* Se). Table 1 also contains values of ¢ correspond- 
ing to P=.90, .95, .98, and .99. For n=5 and P=.98 the number 
¢=.627 ha» the following interpretation. If five numbers are picked at 
randcm, irom a uniform distribution on the interval (0, 1), then it is 
reasonable to expect that about twice in 00 trials the associated sample 
cumulative ¢istribution func‘ion will cut either the line 2+.627 or the 
line 2--.627. 
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TABLE 1 


PERCENTAGE POINTS IN KOLMOGOROV-SMIRNOV STATISTICS 
Values of ¢ for which a =Prob. (D, 2) and P=Prob. (D,* Se) 








a=.10 a= .05 a= .025 a=.01 a= .005 
(P = .90) (P = .95) (P = .98) (P = .99) 





- 90000 - 95000 .97500 - 99000 - 99500 
-68377 - 77639 -84189 - 90000 
56481 -63604 . 70760 . 78456 
.49265 .56522 .62394 -68887 
.44698 50945 -56328 .62718 
-41037 -46799 .51926 5/741 
.38148 . 43607 .48342 .53844 
-35831 -40962 -45427 -50654 
.33919 - 38746 -43001 -47960 
. 32260 - 36866 .40925 -45662 


CoOnNaourwnd 


. 30829 .35242 .39122 .43670 
- 29577 .33815 .37543 .41918 
. 28470 -32549 .36143 .40362 
-27481 .31417 .34890 .38970 
- 26588 -30397 -33760 .37713 
.25778 . 29472 .32733 .36571 
- 25039 - 28627 -31796 .35528 
. 24360 -27851 30936 .34569 _ 
-23735 . 27136 .30143 . 33685 
-23156 . 26473 . 29408 - 32866 


. 22617 - 25858 . 28724 .32104 
-22115 . 25283 . 28087 .31394 
- 21645 . 24746 . 27490 .30728 
.21205 . 24242 . 26931 .30104 
. 20790 - 23768 - 26404 - 29516 
- 20399 - 23320 . 25907 - 28962 
- 20030 . 22898 - 25438 | . 28438 
. 19680 . 22497 . 24993 . 27942 
. 19348 .22117 -24571 -27471 
- 19032 -21756 .24170 . 27023 


. 18732 .21412 . 23788 - 26596 
. 18445 . 21085 . 23424 - 26189 
.18171 .20771 . 23076 - 25801 
. 17909 . 20472 . 22743 . 25429 
. 17659 - 20185 22425 . 25073 
.17418 - 19910 .22119 . 24732 
- 17188 . 19646 - 21826 . 24404 
. 16966 - 19392 .21544 - 24089 
- 16753 - 19148 .21273 . 23786 
- 16547 - 18913 -21012 - 23494 
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TABLE 1—(continued) 








¥ a=.10 a= .05 a= .025 a= .01 a= .005 
(P = .90) (P = .95) (P = .98) (P = .99) 
41 . 16349 . 18687 . 20760 . 23213 . 24904 
42 .16158 . 18468 . 20517 .22941 . 24613 
43 .15974 .18257 . 20283 . 22679 - 24332 
44 . 15796 . 18053 . 20056 . 22426 . 24060 
45 . 15623 . 17856 . 19837 .22181 . 23798 
46 . 15457 . 17665 . 19625 . 21944 . 23544 
47 . 15295 17481 . 19420 -21715 . 23298 
48 .15139 . 17302 .19221 . 21493 - 23059 
49 . 14987 .17128 . 19028 .21277 . 22828 
50 . 14840 . 16959 . 18841 .21068 . 22604 
51 . 14697 . 16796 . 18659 . 20864 . 22386 
52 . 14558 . 16637 . 18482 . 20667 .22174 
53 . 14423 . 16483 .18311 . 20475 - 21968 
” 64 . 14292 . 16332 .18144 . 20289 . 21768 
55 . 14164 . 16186 .17981 . 20107 . 21574 
56 . 14040 . 16044 .17823 . 19930 . 21384 
57 .13919 . 15906 . 17669 .19758 .21199 
58 .13801 .15771 .17519 . 19590 .21019 
59 . 13686 . 15639 .17373 . 19427 . 20844 
60 . 13573 -15511 .17231 . 19267 . 20673 
61 . 13464 . 15385 .17091 .19112 . 20506 
62 - 13357 . 15263 . 16956 . 18960 . 20343 
63 . 13253 .15144 . 16823 . 18812 . 20184 
64 .13151 .15027 . 16693 . 18667 . 20029 
65 . 13052 .14913 . 16567 . 18525 . 19877 
66 . 12954 . 14802 . 16443 . 18387 .19729 
67 . 12859 . 14693 . 16322 . 18252 . 19584 
68 . 12766 . 14587 . 16204 .18119 . 19442 
69 . 12675 . 14483 . 16088 .17990 . 19303 
70 . 12586 . 14381 .15975 .17863 .19167 
71 . 12499 . 14281 . 15864 .17739 . 19034 
72 .12413 . 14183 .15755 .17618 . 18903 
73 . 12329 . 14087 . 15649 .17498 .18776 
74 . 12247 . 13993 . 15544 .17382 . 18650 
75 . 12167 . 13901 . 15442 . 17268 . 18528 
76 .12088 .13811 . 15342 .17155 . 18408 
77 .12011 . 13723 .15244 . 17045 . 18290 
78 -11935 - 13636 .15147 . 16938 .18174 
79 . 11860 .13551 . 15052 . 16832 . 18060 
80 .11787 . 13467 . 14960 . 16728 .17949 
81 .11716 . 13385 . 14868 . 16626 . 17840 


82 -11645 - 13305 .14779 - 16526 .17732 
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TABLE 1—(continued) 





on. ao a= 025 ‘an 0 a= .005 
(P=.90) (P=.95)  (P=.98) (P=.99) 





-11576 . 13226 - 14691 - 16428 - 17627 
-11508 - 13148 - 14605 - 16331 . 17523 
- 11442 -13072 - 14520 - 16236 .17421 
-11376 . 12997 - 14437 - 16143 -17321 
-11311 - 12923 - 14355 - 16051 . 17223 
- 11248 - 12850 . 14274 - 15961 - 17126 
-11186 -12779 14105 - 15873 -17031 
-11125 - 12709 -14117 . 15786 . 16938 


- 11064 - 12640 - 14040 - 15700 . 16846 
- 11005 . 12572 - 13965 - 15616 - 16755 
- 10947 - 12506 - 13891 - 15533 - 16666 
- 10889 . 12440 - 13818 . 15451 - 16579 
- 10833 - 12375 . 13746 - 15371 - 16493 
-10777 . 12312 - 13675 - 15291 - 16408 
- 10722 . 12249 - 13606 -15214 - 16324 
- 10668 . 12187 - 13537 - 15137 - 16242 
-10615 . 12126 - 13469 - 15061 -16161 
- 10563 . 12067 - 13403 . 14987 - 16081 





2. FORMULAS ASSOCIATED WITH TABLE 1 


a = Prob. (D, 2 «). (1) 


It was shown by Birnbaum and Tingey [3] that values of n, a, and « 
which satisfy (1) are connected by the relation 


HEME a 


where (7) is the binomial coefficient and [n—ne] is the largest integer 
contained in n(1—e). They tabulated ¢ for n=5, 8, 10, 20, 40, 50 and 
a=.10, .05, .01, .001, and compared these solutions of (2) with values 
given by the asymptotic formula 


(3) 


derived by Smirnov. 
It is also well known that the random variable D,* has a probability 
distribution independent of F(x) if F(z) is continuous. The limiting 














116 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1956 


distribution of D,* was found by Kolmogorov. Feller [6] has given sim- 
plified derivations for the limiting distributions of D, and D,*. 
Let 


P = Prob. (D,* & «). (4) 
Tables of P for n=1, 2,---, 100 and e=j/n, j=1, 2,---, 15 have 
been constructed by Birnbaum [2], and for n=5, 10, 15, - - - , 80 and 


e=j/n for selected values of 79 by Massey [7]. Both Birnbaum [2] 
and Massey [8], by inverse interpolation, obtained tables of «, cor- 
responding to given values of n, for certain conventional probability 
levels P. 

Table 1 contains values of ¢ which satisfy (1) for a=.005, .01, .025, 
.05, .10 and for n=1, 2, - - - , 100. For n$20 the values of « are com- 
puted solutions of equation (2). For n>20 the values of ¢ in Table 1 
are given by the formula 


e*(n, a) = é(n, a) — .16693n-! — A(a)n-*/? (5) 


where é(n, a) is the value given by Smirnov’s asymptotic forraula (3), 
and A(.10) = .00256, A(.05) = .05256, A(.025) =.11282, A(.01) =.20562, 
A(.005) =.28464. Formula (5) is a special case of (11) which is dis- 
cussed in section 4. 

Evidence discussed in sections 5 and 6 indicates that, for a3.05, 
the values of e shown in Table 1 may also be regarded as values which 
satisfy (4) for P=1—2a. 


3. COMPUTATIONAL TECHNIQUES 


Finding a value of e which satisfies (2), for specified values of n and 
a, involves finding a real root of a polynomial equation of degree n. 
To solve this equation efficiently it is desirable to have methods for: 
(a) making a good first approximation; (b) assuring the rapid conver- 
gence of subsequent approximations; and (c) checking to eliminate 
numerical errors. 

Formula (5) may be used to make a first approximation for a solution 
of (2). The substitution of an approximation for ¢ in (2) requires the 
computation of 


Ada, = (7) (1 — ¢—j/n)“(e + j/n) 6) 


which in the present study was evaluated using 7, 8, or 10 place 
logarithms. 
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For fixed n, the rate of change of a with respect to « can be expressed 
in the form 
da wodsy j-1 n-j 
oo (— PS tse Asn a-- @ 
de joo NE e+j/n 1—e-—j/n 
Formula (7), which can be evaluated without additional use of loga- 
rithms after the products in (6) have been found for a given value of 
¢, provides a method for obtaining a convergent sequence of approxi- 
mate solutions of (2). 
The differential relation 








2a = t ) anode) 
e+j/n 1—e—j/n 
may be used to approximate A ;(n, e+de) when A;(n, €) is known. It is 
useful in checking two computations for slightly different values of « 
and can be used both to locate numerical errors and to estimate the 
number of significant digits in computed results. 

The formula 


asin, = ( 


Asia, 1/s) one (9) 
j~ , € = - j\ ey 
= (n--j+ (nets) ” 
is useful when evaluating (2) for values of e which differ by 1/n. 

A more detailed discussion of computational methods used and com- 
puted solutions of (2) not included in tl.e present paper are included in 
an unpublished master’s thesis by Nelson Prentiss [9]. 





4. DERIVATION OF A MODIFIED ASYMPTOTIC FORMULA 


Using techniques outlined in section 3, numerous values of n, a, «€ 
which satisfy (1) were found with values of a and « correct to seven 
decimal places. Then attempts were made to modify (3) to obtain a 
formula which agreed closely with computed values of ¢ for a specified 
range of values for n and a. This general approach has been used 
previously. 

Modified asymptotic formulas were used by Campbell [4] and by 
Hotelling, Fisher, and Riordan [10]. Aroian [1] used empirically 
modified asymptotic formulas in finding percentage points of the chi- 
square distribution. 

The starting point in the development of (5) was the conjecture 
that, for fixed a, the ratio r=«/é, with e« and é given by (1) and (3), 
might be smooth for fairly small values of n. This was first tested for 
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the fixed probability level a=.10. Table 2 contains computed values 
of « and é along with the ratio r, for a=.10 and for n=20, 25. 30, 35, 
40, 45, 50, 80, 100. The ratio r=«/é was approximated by a function of 
n to produce the formula 


1 log.10 
e(.1) = (1 -— ) . G9) 
6.44245./i — .08892 logis n — .03422 2n 





Values given by (10) are shown in Table 2. 


TABLE 2 


EXAMPLE OF A MODIFIED ASYMPTOTIC FORMULA FOR A 
FIXED PERCENTAGE LEVEL 











! 
n a ‘ é r=e/é «(.1) 
Formula (2) | Formula (3) : Formula (19: 
20 10 . 2315554 . 2399263 .9651105 . 2315554 
25 10 . 2079016 . 2145966 . 9688019 . 2079017 
30 .10 . 1903213 . 1958990 .9715277 . 1902212 
35 10 . 1765872 . 1813672 .9736446 . 1765872 
40 .10 . 1654716 . 1696535 .9753503 .1654717 
45 .10 . 1562342 . 1599509 .9767635 . 1562342 
50 .10 . 1483981 . 1517427 .9779587 . 148398? 
80 -10 . 1178739 . 1199631 . 9825846 .117873« 
100 .10 . 1056273 . 1072983 .9844266 -1056274 




















The methods used to obtain (10) can be employed to find a:. ap. 
proximate formula for any specified value of a. It is also possible to 
develop a single formula which can be used for all a in some interval, 
such as .005Sa3.10, but such a formula, in general, would rot be 
expected to be so accurate as (10) for a=.10. One such formula is given 
below 


e*(n, a) = en, a) — .16693n-' — A(a)n-*/? (11) 
where 
A(a) = .09037(—logio a)*/? + .01515(logio a)? — .08467a — .11145 
and é(n, a) is the value given by (3). 
The accuracy of (11) is indicated by Table 3 where each value of 
a shown was obtained by substituting the corresponding values of n 


and ¢ in (2), and each value of «* was obtained by substituting n and 
a in (11). 
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5. A RELATION CONNECTING P AND @ 


Using the notation of section 1, let 8 be the probability that t’. step 
polygon S,(<) will cut both of the bands F(z) +e and F(z) —e. It follows 
that, for fixed n and «, 


P=1-—2a+8. (12) 


Since (2) provides an exact formula for a, (12) may be used to find P 
when £ is known and conversely. 
If the approximation 


P* = 1 — 2a (13) 
is used as a value for P, the error cannot be greater than an upper 


bound for the corresponding value of 8. For any n less than 100 and 
for any «, 0<¢<1, a bound for the corresponding value of 8 can be 

















TABLE 3 
SPOT CHECKS OF THE ACCURACY OF FORMULA (11) 
a é* 
- Formula (2) . | Formula (11) P-P* 
20 .0688270 .25 | . 249997 .00002 
20 0215335 .30 . 299998 .00000 
20 0053775 .35 350007 .00000 
20 0010595 40 .400037 .00000 
50 . 1282836 14 139997 .00049 
50. 0692174 16 . 160000 .00003 
50 | 0343888 18 . 180000 .00000 
50 | .0157195 .20 . 200000 .00000 
50 0066052 .22 . 220000 .00000 
50 .0025487 24 . 240000 .00000 
50 | 0009021 .26 . 260000 .00000 
100} . 1265906 .10 099999 .00049 
100 | —- 0825244 ll . 109999 .00008 
100 .0516583 .12 . 120000 .00002 
100 .0310470 13 . 130000 .00000 
100 0179127 14 140000 00000 
100 | .0099196 15 | 150000 00000 
100 0052718 .16 160000 — 
100 0026882 17 .170000 | — 
200 . 1002216 075 074999 — 
200 | 0524240 .085 085000 — 
200 | 0253058 .095 095000 — 
200 .0112640 105 105000 —_ 
200 0046232 115 . 115000 — 
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obtained from the table of values of P given by Birnbaum [2]. For 
example, Table 1 of the present paper contains the entry n=12, 
e=.33815, P= .90, which was obtained by (13) using a =.05. 

To find a bound for the error in the approximation, consider the 
entry in Birnbaum’s Table for n = 12 and for the first value of P smaller 
than .90. This is P =.89126 which corresponds to e=c/n=}. If e=} is 
substituted in (2) for n=12, the corresponding value of a is found to 
be .054372. Substitution of this value in (13) gives P* = .891256 which 
agrees to five decimal places with Birnbaum’s value of P. Since £ is a 
decreasing function of ¢, the error in P is less than .00001 for each 
value of ¢ in Table 1 corresponding to n=12 and P=.90, .95, .98, .99. 

For small values of n, and in certain cases for general values of n, 
it is easy to find an exact expression for 8. For example, if n is an even 
integer 


B=0, i/2seS1, 


"|| 
g = (1-291 4—|~ > = a 
2° 2 
where (n n/2) is the binomial coefficient. 


6. ACCURACY OF TABLE | 


The author believes that the greatest error in Table 1 does not 
exceed one unit in the fifth decimal place. For n $20, the values of « 
shown in Table 1 are solutions of (2), computed to seven decimals be- 
fore rounding off to five places. 

For n> 20 values of ¢ were obtained from (11). Table 3 indicates that 
this formula approaches five decimal accuracy for n = 20, for the range 
of a used in Table 1, and is a better approximation for n = 50, 100, and 
200. Formulas (10) and (11) were obtained by completely independent 
methods, yet they agree to five decimal places for a =.10 and 
20sn3100. 

The method illustrated in section 5 can be used to show that the 
error in using (13) to approximate P is negligible over the range on 
which the formula was used. The first 17 values of n and e which are 
shown in Table 3 correspond to entries in Birnbaum’s Table [2], where 
c=ne. In Table 3, P—P* is the difference between the value for P, 
found by Birnbaum, and the value for P* given by (13). 

Additional empirical evidence of this type is contained in a master’s 
thesis by Vincent Donato [5]. For each integer 20<n<100 he chose a 
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value of P given by Birnbaum [2] with P in the range .875 <P <.995. 
He substituted the corresponding values of » and ¢ in (11) and found 
a by inverse interpolation. He substituted this value of a in (13) to 
produce P* as an approximation for the P found by Birnbaum. The 
difference P — P* was .00002 in one case, .00001 in 17 cases, and .00000 
in 61 cases. The close agreement in this limited range supports the 
conjecture that for any n, 8 Sa? which is a special form of a conjecture 
of Wald and Wolfowitz [11]. 

Relatively few of the values in Table } are included in papers 
previously published. The agreement is excellent except for several 
values of P for small sample size. For example for n=10 and P=.99, 
Massey gives «=.490, Birnbaum gives «=.4864, and Table 1 shows 
e=.48893. Massey and Birnbaum found these values by inverse inter- 
polation. For n=10, and .45<e<1, formulas (14), (12), and (2) 
provide a tenth degree polynomial which can be used to find eas ac- 
curately as desired. For P=.99, e=.48893 is the solution of this equa- 
tion, correct to five decimal places. 
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MULTIPLE REGRESSION WITH MISSING OBSERVATIONS 
AMONG THE INDEPENDENT VARIABLES* 


Groras L. Epeerr 
Queen’s University 
1. INTRODUCTION 


SAMPLE from a multivariate norma] population is such that not all 
the individuals are observed with respect to all the variates and it 
is desired to estimate the population parameters from the sample. The 
sample data may be incomplete for various reasons: the cost of collect- 
ing the data may make it inadvisable to collect all of it; the collection 
of part of the data may be impossible. This latter case would occur in 
the mental testing of a large group where part of the group have moved 
away before all the tests have been given. 

Wilks in 1932 [3] and Matthai in 1951 [1] published papers on this 
subject. Wilks considered a sample from a bivariate normal population. 
Matthai considered the general multi-variate normal population and 
carried through the discussion for the trivariate case. Both Wilks and 
Matthai found maximum likelihood estimators of the population 
parameters but in neither case found solutions in closed form for the 
maximum likelihood equations. 

In this paper we find maximum likelihwed estimators for the pa- 
rameters of a trivariate normal population where some of the sample 
observations for one of the variates are missing. For this special, but 
not unusual, case we find explicit solutions to the maximum likelihood 
equations. Finally we obtain the regression equations for the variates. 


2, SIMULTANEOUS EQUATIONS BY THE METHOD OF MAXIMUM LIKELIHOOD 
Let X:, X:, X; be random variables having the density function 


1\%2 
f(x, Za, Zs) = (-) | oii wa 


1 3 3 
exp(-=| E Dove, — mya —m]) 

2k. tnt fut 
where m;, ™2, ™s, ou, O12, ***, 03 represent the parameters in the 
density function, where m,; is the “true” mean of z; and o, is the 
“true” variance cf X; and oi=0;; is the “true” covariance of X; and 
X;. Moreover 
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71, O12 G13, —1 
(ct) = (>: 22 O23) , oi = gil 


731 O32 7G 


|o*| is the determinant of the matrix («%). 

A random sample of size N is drawn from the trivariate normal popu- 
lation given in equation (1); n of the items are observed with respect 
to X;, X2, X; and m with respect to X; and X; only, hence N =n-+m. 

Sample 

Xu Xn Xa 


Xin Xa Xue 


Kin leciaht zee 





Xiw Xay 
The joint density of observations is 
f = f(2u, 1, + * * » Zin, a1, Vo2, * * * » Von, Tai, V2, * * * » Tan | mi, 033) (2) 


where 7, 7=1, 2,3 


f = [[faltie, t20, t2a| mi, oi) [] fo(2rs, 253 | me, ons) (3) 
a=1 B=1 
where k, s=1, 3, 


1 \ 3/2 n/2 1 
f= 4) let” exp] > EX Letra — mee — m 


2r. 


—_—— 





1\™ mie 7 
times {(;-) | ot | exp| — 2 > dS Dd &*(xes — me) (243 — m,) \ 
~ , a 8 ol 


at 


(4) 


a Oi =F18 a - o 
(o**) = ( ) ; of = gt, k,s =1,3 


O31 O33 


1 3n+2n1/2 n/2 ni/2 
= (=) | ot are | t times 


Qn. 


where 


1 
exp (=f {DE Lottie =m) (ere =m) 
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+ DL Laem — my ew - mt |) (5) 
k ‘ 8 


Equation (5) which gives the sample distribution when regarded as 
a function of 210, Tea, Zsa, 21g, Lae Will now be regarded as a function of 
the parameters, m, and o;; in order to determine the maximum-likeli- 
hood estimators of m; and o,;;. When regarded ax a function of the pa- 
rameters (5) is called the likelihood function cf the parameters. Let L 
represent the logarithm of the likelihood functicn. 


+3n + 2n 








n a mM a 
L=- In 27 + In | o4| +> |™| 


1 
= 5] DD Lotitin — mre — mi) 

¢ 27 -@ 
+ ¥, > ps o* (tng — Mx) (Les a m)|. (6) 


k e <6 


We now find the equations which solved simultaneously give the 
snaximum-likelihood estimators, m:, ¢:; of m;, o;;. These equations are 
obtained by setting the partial derivatives of L with respect to m, 
Ms, M3, o!!, o?, o, a2, g!?, o respectively equal to zero. 


oL n 
— [ > {o!(tie — m) + o'*(t2q — m2) + o'*(23q — ms) } 


om, 


Te (7) 
+ >> {F(x — mi) + 7*2a3 — ma) } | 
p=i 
pom — =| *, {o!(tie ™ m,) + o*? (Lee ae m2) + a7? (Tse = Ms) | (8) 
éL = 
— YS {o'(x1e — m) + o79(22q — m2) + o**(tzq — ms)} 
= (9) 


+ 5 {?(eu — mu) + 3200 — me) |. 
Buel 


Before finding the partial derivatives with respect to o*/ we note the 
following relations. 


go}! og? | o' og? ! | o?? a3 | 
| go?) o?? o?8 a”? | " | go a3 
G82 rene | Gyetieenne sy (GO ee apiemnnns (08) 
o% go? ot? 
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We shall derive the first of these relations; the others may be obtained 
in a similar manner. Further note that ¢!? =¢7! =¢*¥ = G* ==0. 
By definition, 


a 
gu = — - 
Tu 713 | 
| 
O31 «233 | 
733 
. ot = — (10’) 
g | oes | 
Now , 
ot git} 
| | 
} o21 gt? | | gil gl 
33 = = = | Ci; |. 
| ott | |'g%t gg? 


Substituting in (10’) for o33 we obtain 


| o G4 
| gt gt? | 
gil = . 
22 


oC 
We now find dL /de*, 1, 7=1, 2, 3, making use of (10) 
OL 





1 1 1 
—(n + mon — — Do (tie — mi)? — — D (ais — ™m)*? (11) 
2 2 ‘a 2 3 











dq}! 
aL 
ac? (n + m)oi — 2) (tia — mi) (era — ma) 
og a 
A 4 (12) 
v — i (213 jo m,)? + —— : (X18 - m1) (Xs om M3) 
Az Ax ‘5 
aL 
da'3 it (n + m1) o%3 ue ¥ (2a aa m1) (Zsa — ms) 
. (13) 
— > (ais — m) (ap — ms) 
4 
aL n ny; (= Ags 1 
Ro Op i — om som L2qa — M2)? 
007? 2 " 2 \Aoe we Aw on) 2 py (22 wee 


1 A's ApAs — 
— = SBS (aug — m)* — 2 F (eis — ms) (209 — ms) (14) 
2A 22 8 A’n 8 
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1 A's 
- 3 R.4 DX (zap — ms)? 
aL f 
028 = (n + N1) 023 ~ oz (tea - Mz) (Lea _ m3) 
f ’ (15) 
Ay Ags 
waar a :H (21g — ™)(X3g — m3) — —— Le (ag — ms)? 
Ax» 8 Ax 6 
aL 1 
jo OD 9” + m)ox — — >> (aa — Ms)? — — >> (x38 — ms)? (16) 


where Pa is the cofactor of o;; in loul, i, j=1, A 3. 
We now express o*/, ** and &, in terms of a;,;’s and set the partial 
derivatives equal to zero. After some simplification (7), (8), (9), (11), 
, (16) may be put into the following form: 


[(nAyAse+n1 | oi; | o33)™,+NA 2A o9me+ (NAA 2— Mm | o.;| o13)ms| 


~*~ | Aine + Liat AwAgs >» Leat+A3Are > Ta 





(17) 
+| oi | os Do te— | os | o1 Law| = 0 
8 8 
n(A yom + Azym2 + Aagms) — | As >> LiatAr p¥ La (18) 
+Az y ne | =0 
[(nAysAo2— m1 Ci; 013) +nAv3Asome+ (nAggAve+m | ois | oi) ma] 
an | And + > Liat AwAge ; TeatAgsAn pm X30 (19) 
—_ oi; | O13 : List | a5; | ou De rm |= 
8 8 
(n+ mn )onu — > (Zia ~~ ™)? — p E (tig — m)*? = 0 (20) 
a 8 
An(n + m)ow — Ar > (tie - m) (Zea a m2) 
“ (21) 


+ Ar x (a1 — mx)? + Ass u (21g — my) (2x9 — ma) = 0 


(n + m)ois — * (Lie — ™)(Lsa — Ms) 
x (xis — mi)(2as — ms) = 0 (22) 
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nA*ogg22 — As(Asso12 + Assors) — A’sa >, (12a — Mm)? 


— ¥ [Auley =m) + Anlee — eo © 

Aa(n + m)o3 — Ag > (®2q — M2)(Tsa — Ms) 
+ Awd (ts 4 m) (ass — ma( + Aas Dd) (tap — ma)* = 0 2) 
(n + m)oss — bs (tsa — ms)? — x (ta — sal = 0. (25) 


3. THE MAXIMUM LIKELIHOOD ESTIMATORS 


The simultaneous solution of the nine equations (17) to (25) inclu- 
sive given in section two gives the maximum likelihood estimators 
My, Me, Ms, 11, G22, G33, O12, O13, 623 Of the parameters, m, m2, Ms, 11, 722, 
733, T12, C13, 723 Tespectively. We now solve these equations. 

Multiply (2.17)! by on, multiply (2.18) by ow A», multiply (2.19) 
by o:3 and add these products. After considerable rearrangement and 
making use of the fact that o::Au+o2:An+o3;:Ax equals |o,,| or zero 
depending on whether 7 equals k or not. obtain 


(n+ m)m, os > Mie + > Tig 
a 8 
y ¥ ST 


N 


(1) 





ee m, = = Zi 
where N =n+m, and the summation is over all values of 2. 
Multiply (2.17) by o13, multiply (2.18) by o23;A2, multiply (2.19) by 
o33 and add these products. After considerable simplification we obtain 
Ms = Z3. (2) 


Substitute in (2.20), (2.22) and (2.25) the values of m; and ms, ob- 
tained in (3.1) and (3.2) and obtain the maximum likelihood estimators 
of o1, 013, 033, namely. 


; (a1 — %)? 





Gu = 5 aaa (3) 
oy = Ena" : 
i- % — Zs) 
ie D(z a é 5) 





3 We mean by (&, j) the jth equation of section k. 
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where in (3), (4) and (5) the summation extends over all values of the 
variables. 

Substitute for >>, in (2.21) and (2.24) the values obtained from 
(2.20), (2.22) and (2.25), and after making use of the theorem in de- 
terminants mentioned above obtain 


Ag » (Zia — ™) (2a — M2) + Ay YY (Zia — ™m)? 
+ Ao Mi (Zia —- )(X3sa — ms) = 0 (6) 
Ag z (Zea — ™)(Laa — Ms) + Ais Le (Xie — ™)(Lsa — Ms) 


+ Ax 3; (Zsa — ms)? = 0. (7) 


! 


| on o1 
Ax ad | ’ 


713 «(33 


substitute the maximum likelihood estimators of oy, 013, ¢33 obtained in 
(3), (4) and (5) and obtain the maximum likelihood estimator of Az, 


> | Gy G43 


32 


Gis Oss ” 


T= p (tia — %,)?, TT: = :m (%sa — Zs)’, 


Tn © » (Lia as 2:) (se _ #3). (9) 


Substitute the maximum likelihood estimators of m, ms, and Ag in 
equations (6), (7) and (2.18) and after some rearranging obtain 


sf 


Ass (ne, > ria) me + TiAy + TyAss 
, (10) 


= Ae (2 u Te — u iietse) 


An (nz, - ) tna) me + Ti:Ai2 + TsA2s 
! (11) 


= An (2 u Tae — u raat) 
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nAsem, + (nz, = > tia) Ay + (nz, - ps >) Ass 


- p> Zia; a3 = ns — >> tee 
a 


nA = Aa(2 +S Lea ~~ : M tats), gj: = Ax Zz 2% 


gj: = An (x > Ta — , w taaine) 


C1 = Aged, a= nAzo, Cs = Ards 





After making the above substitutions in (10), (11) and (12) we obtain 
qm: + TiAn + TuAs = gi (14) 
cym, + T3Ai2 + T3Axg = Qs (15) 
CxM2 + Ay. + G3A23 = ge. (16) 


Solve (14), (15) and (16) simultaneously for me, A12, A23; the obtained 
solutions are th2 maximum likelihood estimators of m2, A12, Ass, namely 


fia, Ais, Aes respectively. 

(17) 

| 9 7; Tis gi T's | | 1 Ti gi | 

| Js Tis Ts Js Ts cs Tis J 
g2 % Gy g2 as Co Q g2 

la Ti Ths T, Tis aT; Tis 

‘ Ti3 Ts Tis Ts cs Tis Ts 

| 


Co Qi as | aq as Ce A, as | 


nm = ——————_- 

















oi2 (O23 ou oO | 
= Ax, 
ig «= 33 | O12 Og | 


— Ass 


hence we may write 
033012 — O13023 = — Ais 


13012 — Tud2 = Ag. 
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Substitute in (18) and (19) the maximuni likelihood estimators of A1s, 
Ass, O11, %33, 013 given in (17), (3), (4), (5) respectively and solve (18) 
and (19) for oi. and o23. The values so obtained are the maximum likeli- 
hood estimators of o1. and os, namely é and 63. 











|\— Aye 613 | S33 —Ay 
| doe On a A 
G12 = ——— , G23 = nih ositcmpmetit (20) 
| G33 Gis | 633 Og 
Gig Gn — | ds ou 








Substitute the maximum likelihood estimators of m, ms, me, Ais, A2s, 
O12, O23 given in (1), (2), (17) and (20) respectively in (2.24) and solve 
for the maximum likelihood estimator of oz, namely é22. In this result 
let 


T = >. (Tea o— mz)’, T,’ = + (x13 soa Z,)? 
a 8 


T;' = 2) (233 — %s)* 


8B 


, 21 
Dd (tis — #1) (a3 — Fs) (21) 


T 


II 








k, = mAo2(Arseis + Aas), ) 


1 on i oa a J 
622 = nA. [ky + A®eT, + AXTy’ + 2A2AaT is’ + An Ts’ }. (22) 
41 22 
We have now compieted the solution of the nine simultaneous equa- 
tions (17) to (29) of section two and obtained the maximum likelihood 
estimators of the parameters m1, me, Ms, O11, G22, %33, 12, 13, 23. 


4. THE REGRESSION EQUATION OF X;, ON X2 AND X; 


We desire to estimate for the sample given in (1, A) the regression 
coefficients 8; in the regression of X, on X» and X; holding the latter 
two variables constant. The regression equation is 


Xi = Bo + BX + 8;X3. (1) 





In standard units (1) becomes 


Ly = Be'x2 + By’ zs 
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where 
jie and 6; = 6:4/—- 
Ci Cu 
From (2) we obtain 
E(x%2) = Bs'E (a2) + Bs’ E (xs) (3) 
E(aits) = Bs’ E(x2t3) + Bs’E (25°). (4) 
Replace (3) and (4) by 
Bo’ + prsBs’ = pis (5) 
pf’ + Bs’ = pis. (6) 


The maximum likelihood estimators of 8,’ and B;’ are determined by 
replacing pis, pis aNd pos in (5) and (6) by their maximum likelihood esti- 
mators and solving the resulting equations for the 6”s (see Wilks [2] 
page 143) 


-~ - ~ 








a, Piz — Pisp23 , pis — pi2pes 
3,’ = ————, . _ B,! = ———— 
1 — pos” 1 — pos? 
where 
© 643 
" V 6 i653 


and é,;, 1, j,=1, 2, 3 have been obtained in section three. Finally we 
have 


a 


s=pig/— 632.8 

bs =B “Ss 

Bo = X; — hin — Xs. 
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ON LINEAR COMBINATIONS OF SEVERAL VARIANCES 


B. L. Weicn 
University of Leeds, England 


1. INTRODUCTION 


PROBLEM which occurs frequently in statistical practice is that of 
A estimating a quantity V, say, which is expressible linearly in 
terms of a number of basic variances 0”, a2? - - - 0”, i.e., a quantity 
which has the form 


V = yo}? + A2o2? see of Aron’, (1) 


where the \’s are known numbers. In several contexts where this prob- 
lem arises the data provide independent estimates s,;* of the o;? ({=1, 
2---+k), such that s,;*/o,* has the standard mean square distribution 
x.*/f; with f; degrees of freedom. Two types of situation may usefully 
be distinguished viz., (i) that where the \,; all have the same sign, 
which without loss of generality may be taken to be positive, and (it) 
that where some of the ; are positive and some negative. Statistical 
literature contains, perhaps, most references to (77), mainly concerned 
with the problem of estimating sc-called “components of variance.” 
Practically the whole of one issue of the journal Biometrics, for instance, 
was recently devoted to examples of this character (Vol. 7, No. 1, 
1951). The situation (7) on the other hand is not so often mentioned 
but is also of importance and, moreover, where it occurs it is often 
relatively easy to deal with. The chief examples of it, are those involv- 
ing the estimation of the gross variability of material: for which several 
individual sources of variability can be identified. 

In the present paper I shal} begin by describing what is probably 
the most familiar context in which both types of problem, distinguished 
above, may arise. This will suffice to indicate the main kind of diffi- 
culty which is encountered in practice in the attempt to obtain good 
estimates of quantities of the form V. Then I shall discuss briefly two 
large-sample approximations which have been proposed for calculating 
confidence limits for V. And finally I shall develop modifications of 
these approximations designed to be applicable to moderate-sized 
samples. : 

2. TWO COMPONENTS OF VARIATION 


It often happens that a set of data consists of mn, say, observations, 
constituted by first drawing m units at random frorn a large population 
of major sampling units and by then drawing n minor sampling units 
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at random from each of the selected major units. The major units are 
therefore to be regarded as sub-populations for the second stage of 
sampling. It will be assumed here that each of these sub-populations 
has the same large number of minor units in it and has the same vari- 
ability among the minor units. Let us denote the common population 
variance “within major units” by ¢,*. The variance between the means 
of the sub-populations, i.e., the variance “between major units” will be 
denoted by o. 

The information about ¢,,* and o»* contained in the sample may then 
be summarized in the standard analysis of variance form of Table 1. 
The notation here is self-explanatory. 











TABLE 1 
Quantity 
Source of s f Degrees of Mean estimated 
variation = ee freedom square by the 
mean square 
Between 


majorunits | S:=)on(zs.—2..)? | fi=(m—1) | 92=Sif7 | o:%*=noy?+ou? 





Within 
major units S:=> > (ami —20.)? f2=m(n—1) | 82? = S2f27 o2° =o" 

















We refer to the two mean squares as s,? and s,*. The expected 
values, o;” and o;”, of the mean squares are expressed in the last column 
of the table in terms of o,? and o,,*. Assuming normality s,2/0;? and 
&"/o9? are independently distributed in standard mean square forms 
so that the basic assumptions of our introduction are satisfied. 

(t) The estimation of gross variability. By the gross variability we 
mean the variability between minor units in the grand population ob- 
tained by throwing a!) the sub-populations together. The gross vari- 
ance, ¢,, is related to the individual components o,? and o,,? by the 
equation 

o,? = on? + ow’. (2) 


This is often the quantity of ultimate importance where variability is 
being studied. With manufactured articles, for instance, the over-all 
variability of the product may be the quantity specified. Estimates 
of components of variation due to different factors must, of course, be of 
some importance to the manufacturer wishing to meet the specification. 
He may find, however, that he has considerable latitude in the choice 
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of the levels at which to control the variability due to some at least of 
these individual factors, subject only to the over-all variability of the 
final product being kept sufficiently small. 

Given sample data, summarized in the form of Table 1, it is clear 
that in terms of o,? and o2* we have 


V =a,? = n-"'o;? + (n — 1)n"o??, (3) 


which is of the general form (1) with the }’s all positive. An unbiassed 
estimate of the gross variability is therefore given by 


v = 8,2 = n-'s,2 + (n — 1)n-"'s,”. (4) 


The sampling variation of v may be deduced by noting that we may 
write 


v= axi’fim' + aaxe*fe, (5) 
where 
a, = n-o,? = (no,? + ow?)n— \ (6) 
a, = (n — 1)n~o,* = (n — 1)n—"o,? 
and 
fi=(m-1); f2a=m(n—1). (7) 
In particular it follows that 
var (v) = 2(a;7f;-? + a,?f2-') (8) 
and that 
C. of Var. = y 2(a, + a2) (a,2f,-! + anf) 2. (9) 


This expression decreases from +/2f:-/? to +/2(fi+f2)7/? as ayaz-"/? 
increases from 0 to fifs-!, and then increases from +/2(fi+fe)-/? to 
/2f,-"? as a,a2~! increases from fif2-' to ©. Actually from (6) and (7) 
it may be seen that (a,a;-'—f,f.~") is always positive and will become 
large if oc. ! is large. The maximum possible value of the coefficient 
of variation is therefore »/2f;-*. Provided m is large enough, the esti- 
mate of o,? will be a good one, therefore, whatever the circumstances 
(at least in so far as it is agreed that the sampling coefficient of vari- 
ation of v is an appropriate criterion of goodness). It does not suffice, 
however, merely that mn should be large. The total sample size can be 
very large and yet in certain circumstances provide only a very poor 
estimate of o,?. 

(ti) The estimation of variance components. An estimate of ¢,’ is 
given directly by s*, and does not need further consideration. The 
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other component of variance, 07, is related to o;* and a? by the 
equation 


V =o,’ = n-!e,;? — n~¢,*, (10) 


which is of the general form (1) with the ’s not now all of the same sign. 
The unbiassed estimate of o? is given by 


v = 8,” = n—1s,2 — n-'!s,”, (11) 
This is again of the form (5), but now 


a= n-o,? = (no? oo “Fd 


1 


(12) 


a=- o> nox? 


As ovo! increases from 0 to ©, a2a;~' increases from —1 to 0 and, 
from (9), the coefficient of variation of » decreases from © to »/2f,-"?. 
In other words the coefficient of variation always exceeds »/2f,-”? 
and may exceed it very much if oo,,~' is small. Even if m is large it 
does not now automatically follow that the coefficient of variation of 
v will be small. 

For example, if m = 20 and n =2, and if the unknown o;? which we are 
estimating happens to be equal to }¢,”, then the coefficient of variation 
of the estimate, s,?, is found to be 1.161. This is worse than the result 
we could obtain if we had a sample of only three independent observa- 
tions from a single homogeneous population of quantities varying 
normally with standard deviation oy. 

Of course it might be argued that the coefficient of variation is not 
necessarily the best criterion with which to judge the accuracy of 
8%’, particularly when o;? is small. What the best criterion is must de- 
pend on the purpose for which the estimate of o;,? is required. If this 
purpose can be formulated precisely then some other ap,zoach might 
be more appropriate. Lacking this more detailed formulation, how- 
ever, we are forced to look at the estimation of o? as an object in itself; 
and we are merely following a common general procedure in problems 
of estimnation when we say that an estimate obtained in a given com- 
plex situation is equivalent to one obtained from a certain number of 
observations in the much simpler hypothetical situation of sampling 
from a homogeneous population. In the present case the use of the co- 
efficient of variation of v enables us directly to make such statements of 
equivalence. In the particular example quoted, although there are 
m =20 major units in the sample, the sample is, speaking roughly, only 
worth between two and three observations for the purpose of estimating 


on. 
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In situations involving more than two components of variation there 
will be several linear functions of the basic variances which may have 
practical importance. Some of these will have positive coefficients and 
some will have coefficients of both signs. In general the latter will lead to 
the more imprecise estimates. Hence, aithough in certain of the theoreti- 
cal derivations given below no distinctions are made depending on the 
signs of the \’s, the degree of usefulness of the theoretical expressions in 
practical applications does always depend on which of the two types 
of situation we are confronted with. 


3. NORMAL APPROXIMATION FOR LARGE DEGREES OF FREEDOM 
In general we have to consider 
V = X81? + AgS2? - - + AgSe?, (13) 


where s,?/¢,7 is distributed as x,?/f; with f; degrees of freedom (1=1, 
2.-.--k). The distribution of v follows by noting that v is of the form 
> aixef; where a;=),,2, but it cannot in general be reduced to any 
very simple expression. The mean of the distribution, however, is V 
and 


var (vl) = 2 > aff = 2 Do rAZosf. (14) 


Moreover when the f; are all large the ratio (v—V)/+/var(v) tends to 
the standard normal form. In other words, if £ is the standard normal 
deviate exceeded with probability (1—P), we have 


Pr[v — V) < tV(2 DAdvosf-)] = P. (15) 
More precisely it can be shown that 


Pr[wv — V) < tV(2 Dd AvoAf—)] = P + O(-”), (16) 


so that in saying that the probability of the inequality on the left- 
hand side of (15) is P we are committing an error of order f~"/?. 

Now since s,*/e;? is distributed about unity with variance 2/f; 
(t=1, 2 - - - k), we have, in the probability sense, 


Lastoifit = Di rteefe{l + Of) }. (17) 
Hence if >-),*s,4f;-" is substituted for >-d,*o,4f;— in (16) we shallstillhave - 
Pr[w@ — V) <tV(2 DY Assetf-)] = F + 0°"). (18) 

If, then, we make the assertion 


Pr{(v — V) < eV 2 Dd atefz-)] = P, (19) 
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our error in probability will stili be of order f-/? and will become negli- 
gible if the f; are large enough. Equation (19) is equivalent to saying 
that 


Pr[V >v —&/2 Dd dAvsif-)] = P (20) 


and by choosing appropriate values for P two-sided confidence limits 
for V can be obtained. For instance we may deduce 


Prlp—tV/(2 Desf) < V <ot+tV2D desf-)] = 2P -1. (21) 


This is the classical large samp'e method whereby limits of error on 
an estimate are obtained by setting off from it certain multiples of its 
estimated standard error. This method is so much a commonplace in 
statistical practice that statisticians are scarcely expected to justify 
it every time they make a particular application of it. And indeed 
this is, perhaps, fortunate since detailed discussion is apt to reveal 
marked differences of opinion even about the meaning of statements 
of the form (21). It may be advisable to stress, therefore, that in the 
present paper I understand the left-hand side of (21) to mean the prob- 
ability that a pair of variable limits will contain the fixed (but un- 
known) value V. The two limits in question are calculable from the 
sample values and will vary with the sample. The term “probability” in 
this connection has reference solely to the fact that the sample values 
are regarded as random variables. 

One minor point should possibly be mentioned here. The sampling 
variance of vis2 >-d,*e,4f;— and this has been estimated by 2>>A284f-1. 
It was pointed out by Daniels [5], however, that s,4 is not an unbiassed 
estimate of o,4 whereas f;s;4(f;+2)—! is. Hence an unbiassed estimate of 
2 >>A,20:f- is given by 2 >>d,?s4(f; +2). It might be argued that this 
latter is a more appropriate quantity to employ. It is clear, of course, 
that in large enough samples no difference would be made by using this 
revised estimate. If there were a large number of s,;? each based on very 
few degrees of freedom, however, it would matter which estimate was 
used, I am undecided what should be done as a general procedure but 
in the present paper shall adhere to the estimate 2 >>),’s,4f;-! on 
grounds of simplicity. 


4. ALTERNATIVE APPROXIMATION WHEN ); ARE POSITIVE 
_ When all the 2, are positive both v= >>d,s2 and V= odo," are 
essentially positive quantities. The normal approximation does not, 
however, take any account of these facts and, indeed, it is possible 
(even if extremely unlikely when the f; are large) for (20) to give a 








138 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1956 


negative confidence limit for V. Such an objection cannot be lodged 
against an alternative method which is to approximate the distribution 
of v by a Pearson Type III curve with start at zero and with mean and 
stantard deviation equal to the true mean and standard deviation of v. 
This is equivalent to approximating the distribution of v/V by the 
standard mean square, or x?/F, distribution with 

F = (QD dw 2) Dd APo if). (22) 
I have provided in an earlier paper some calculations which show the 
usefulness of this approximation (Welch [9], Tables II and III). 
More exhaustive investigations have been carried out by F. E. Satter- 
thwaite [8] and G. E. P. Box [2]. My own limited results are quoted 
again in Table 2, below. They related only to the case k=2 with f,=4 
and f2=20. Here it was not difficult to deduce the exact distribution 
of v/V in terms of tabled functions. The distribution depends on 
population parameters only through the ratio y =),017(A,o17-+Agos")—*. 
For the values of y in column 1 of Table 2, below, and for the values of 
p in column 2, I obtained the exact values of Pr{v/V>p} shown in 
column 4. 

















TABLE 2 
Probability that vV-" exoeeds p 
don? . ™ 
y= —— p mae FS True | Approxima- 9 SS 4 
(you? +Azer2*) prob- tion using - eer 
ability | effective F hog 
1.000 2.222 4.00 0.064 0.064 0.042 
.973 2.151 4.22 .068 .068 .047 
.942 1.163 4.51 .321 .325 .403 
.926 1.487 4.66 .192 .195 .228 
.902 1.961 4.91 .083 .083 -066 
. 866 2.685 5.31 .019 .017 .003 
.808 3.846 6.07 .001 .001 .000 
.808 1.709 6.07 .112 .114 .108 
710 1.449 7.67 .166 .174 .189 
.621 1.212 9.64 .261 . 280 .321 




















The third column of Table 2 gives the effective numbers of degrees 
of freedom, F, calculated from equation (22) and the fifth column the 
approximation to Pr{v/V>p}, assuming v/V to have the standard 
mean square distribution with F degrees of freedom. The approxima- 
tion is obviously very good throughout. 
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The particular y and p instanced in Table 2 do not, of course, give 
as good a coverage of the possible values as would be desirable in a 
thorough investigation of the approximation. They just happen to be 
the values of y and p which arose out of the somewhat limited problem 
discussed in the paper I am quoting from. The very good agreement 
between columns 4 and 5 is partly due to fe being very much larger 
than fi, coupled with the fact that most of the y are near to unity. 
In these circumstances the distribution of v/V is mainly determined by 
its first constituent, which, apart from the multiplying constant, is 
exactly of the mean square form. However, this is a situation of fairly 
frequent occurrence in practice so that the comparisons in Table 2 are 
not irrelevant. 

In the. final column of Table 2, I have added the values of 
Pr{v/V>p} obtained by using the normal approximation of Section 3, 
above. The agreement with column 3 is not good. This is not sur- 
prising for the normal approximation demands that both f; and fz should 
be large. Since f; is only 4 and since the shape of the true distribution 
of v/V in the examples of Table 2 is determined mainly by the shape 
of the distribution of x,*/f:, which is far from normal for four degrees 
of freedom, we cannot expect good results. 

It remains to be said that in adapting approximate probability 
statements about v/V to provide approximate confidence limits for 
V, the limits finally obtained will depend on the F of (22), i.e., on the 
relative magnitudes of the o;. A further approximation is therefore 
involved in that F must be estimated by using the observed s;. How- 
ever, as in the case of the estimation of the standard error used in the 
normal spproximation, this procedure will not affect the accuracy of 
the contidence probability to order f°. 


5. ALTERNATIVE APPROXIMATION WHEN \; HAVE DIFFERING SIGNS 


When all the \; are not of the same sign the linear function Yaris? 
is not confined to positive values but can take all values from — © to 
+. In general, therefore, it would not seem that an improvement on 
the normal approximation to the distribution of v will be obtained by 
using a distribution confined only to a restricted range of the variable. 
Some qualification of this statement is, however, necessary. Often in 
practice, although some of the \; may be positive and some negative, 
the population quantity V= >>d,0?, which is being estimated, is es- 
sentially positive. This was so, for instance in Section 2(7), above, 
where the component of variance o,*=(0:7—o2*)n—! must be positive 
although its unbiassed estimate v= (s;2—s2?)n~' can be either positive 
or negative. Now in such a situation, although there is the chance that 
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v will turn out negative, we will not expect this chance to be large—~ 
or at least ii it is large then the data will be virtually useless for tae 
purpose of estimating o,*. In situations where a reasenably good es- 
timate of o,? is possible (i.e., an estimate with a small coefficicnt of 
variation), the chance of a negative v will be small and the treatment 
of the distribution of v as if it were a Type III distribution, confined 
to positive v, may be justified. But obviously some care is needed before 
this procedure is adopted. 

If the quantity V being estimated were not essenti..iy of given 
sign there would seem to be no grounds at all for approximating the 
distribution of v by a limited range type of curve. 


6. THE CALCULATION OF CORRECTIONS TO THE NORMAL APPROXIMATION 


The normal approximation described in section 3 demands that all 
the f; be large for its successful application. In certain circumstances 
where all the f; are not large and the normal approximation fails good 
results can, as we have seen, be obtained by approximating with a 
Pearson Type III curve, the reason being that often the.distribution 
of v=) \,8,2 is predominantly determined by one of the d,s;*. In 
general, however, care is necessary with this method. 

In the present section I wish to return to the normal approximation 
and consider whether it can be modified to be applicable to more 
moderate sized f;. The starting point will be equation (18) of section 3, 
namely, 


Pr{(v — V) < eV(2 De APsif)] = P + O(f-"*) (18 bis) 


and the question we shall ask is whether we can obtain a correction 
g(s*), say, which is a function of the sample values only and which has 
the property that 


Pr[w — V) <éV2 D Ataf) + g(s)] = P+ 0(f-), (23) 


where / is greater than }. If this can be done the application to the 
problem of calculating confidence limits is obvious. 

It should be mentioned that the particular case oi estimating an 
external component of variation has already received some discussion. 
M. S. Bartlett ({1] p. 312) has used this problem as an illustration of 
his general theory of calculating confidence limits using properties of 
the likelihood function. He gives a correction to the normal approxi- 
mation which, to the order of magnitude concerned, reduces to the one 
I give below. My own discussion here does not use any very general 
theory, but presents the problem in terms similar to that which I 
adopted in my discussion of the calculation of confidence limits for the 
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difference between two means (Welch [10]). This approach appears 
to lend itself more easily to the development of successive corrections. 

A further paper on the estimation of an external component of vari- 
ance has been published by J. R. Green [6]. This also has the object 
of developing successive approximations to confidence limits, but does 
not start from the basis of a normal approximation. Green’s method is 
in one sense more general in that it does not assume all the f; large. 
In the notation of Table 1, above, Green assumes that f2=m(n—1) is 
large but not necessarily f; =(m—1). 

Even closer to the procedure I am about to describe is that used by 
A. Huitson [7] who also reaches the result of my equation (42) below. 
The algebraic details of our derivations differ, but basically the methods 
are the same. 

Returning then to equations (18) and (23), the function g(s?) which 
is sought is a function of the observed s,* and is therefore a random 
variable. An easier problem than the solution of (23), and one which 
will be considered as a preliminary, is to find a function g(o*), de- 
pending on the population parameters rather than the sample statistics, 
with the property that 


Pr{(v — V) < EV2 Ss) + g(o*)] = P+ 007-9. (24) 


[Strictly speaking, for J general, the functional form g(c?) satisfying 
(24) will not be the same as that of g(s*) satisfying (23) and it would, 
perhaps, be as well to stress this fact by using different symbols for the 
two. However we shall not do this since for the particular I finally 
considered the same functional form does satisfy both equations. ] 

If we write 


u=v—-V—-i/2 > rds), (25) 
then equation (24) is equivalent to 
Pr[u < g(o*)] = P + O(f-). (26) 


Now by using straightforward Taylor expansions, either explicitly 
or in symbolic form (c.f., for instance, Welch [11] p. 331), we can ex- 
press the moment generating function of u(2p.)-/? in powers of f-/?. 
Hence the cumulants of this quantity can be shown to be 


mn = —&+ 0(f-) 

ke = 1 — 2y/2Epsopn-*/? + O(f-%) 
ks = 2y/2pspn*/? + 0(f-") 

ke = O(f-(/2) +1) (s > 3) 


; (27) 
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where I am now writing Darvoesfz = pa. These results are sufficient to 
enable us to use the formula of Cornish and Fisher [4] to approximate 
to any percentage point of u(2o2)~/*, the error being of order f~’. 
Applying this formula we obtain finally 


pk 2 
Pr [w < ate na) 


so that we have solved (26) with /=1. 
Now let us examine what happens if we replace the pw in (28) by 
ra» Where 


pu] =P +0, (28) 


ra = > d*829f-*. (29) 
Since in the probability sense s,;* differs from o,? by 0(j-"/), it is clear 
that rserz:~! will differ from ps.pn~! by 0(f-*/?). Since the standard devi- 
ation of u is O(f-"/*), the effect of replacing the p.» in (28) by ra will 
therefore be to modify the probability by 0(f-'). Hee also 


—2(2# +1 
Pr lw < ate rare] =P+0 (30) 


In solving (26) with /=1 we have therefore at the same time solved 
(23) with 1=1. It would not have been worth while to have kept extra 
terms in the expressions (27) with the object of deriving g(o*) more 
accurately to satisfy (26) with 1>1. For in the final stage, the substi- 
tution of g(o?) by g(s*) would still permit us tu satisfy (23) only with 
l=1. For higher order accuracy, however, we can carry out another 
cycle of the whole method. Thus if we write 
2(2¢? + 1) 


w= 6 orn (31) 


we can proceed to find a further correction ge(o*) such that 

Pri[w < g2(o*)] = P + 0(f-*/) (32) 
and examination of orders of magnitude will then show that we also 
have 

Pr[w < g2(s*)] = P + O(f-*”). (33) 
The algebraic details of this work are heavy and only the bare outline 


will be given here. We find for the cumulants of w(29.:)~"? the ex- 
pressions 


2(2¢* + 1 
4 V2Q8 +0 


- psp *!? + Enyspn~* — poop! + O(f-*/?) 


nw —F 
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8(2¢ + 1) 
3 

+ 4(2g? + 1)paspn~* + 2% papa? + O(f-*/) 
ks = 2/2pspn-¥!? — 18£papn~* + 6Eos*pn* + O(f-**) 
ke = 12pspn-* + O(f-*?) 
ke = O( f+) (s > 4). (34) 


ky = 1 — 2y/2kpsep-¥/* — *pa* 


From these it follows that the P-probability level of w(2px)—'/? is 
— Epxopar' + 3(5E* + GE) paso — $(168* + 23£)ps2*on-* + O(f-**). (35) 


Now the expression (35) is seen to be of order f-'. Alse, as noted above, 
ra» differs from pa by 0(f-”?). It follows from these two facts that we 


have also 
Pr[w(2ran)-!? < — Eregrar! + 3(5E? + 98)rera~? 
— $(16E* + 23€)rsern*] = P + O(f-*). (36) 


On substitution for w from (31) and (25), the inequality in the 
square bracket of (36) becomes 


V/2(2¢? + 1) 
3 
+ 3(5&* + 9é)rasram? — $(16E? + 23é)rae’re*. (37) 


The first term in the right-hand side of (37) is of order f°, being the 
large-sample normal approximation; the second term is a correction 
of order f~?; the next three terms combine to form the correction of 
order f-'. By continuation of the method we can add on further cor- 
rective terms which will make the probability in (36) become equal 
to P with error successively of order f-*, f-*/*, etc. The series so obtained 
does not converge in a strict sense. It will cease to be of value if 
is too large or f too small. But the inclusion of one or more of the 
groups of corrective terms will usually give much improvement over 
the large-sample normal approximation. The actual range of usefulness 
of the series can only be explored by numerical trial 


(v — V)(2ra)-¥? < & — Peon *!? — Freer 


7. AN ALTERNATIVE FORM 


If V is essentially positive and if the estimate v is either essentially 
positive or at least has a negligible chance of being negative it might 
seem more appropriate to throw (37) into a form involving the ratio 
v/V rather than the difference (v— V). This can easily be done for 
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Oe {1 ~<a -\" 7 {1 — ry o— ah (38) 
= Pr = TU (2ra) is 


The second term in the curly bracket of (38) is of order f-/?. Let us 
therefore expand by the binomial theorem including terms up to order 
f-**. In the resultant expansion substitute for (v—V)(2rx)-? the 
expression on the right hand side of (37). We then fud that, to the 
order concerned, (37) is equivalent to 








3 2 
vv 1+ V2E(ra%) — = (2&* + 1)rserav—! + 2Ere0-? 


— V2Erera V2 + 2-4 2(5E* + OE)reara*/2v-! 
V2 
“ee (16&* + 23£)rse*rn*/*0— 


4y 2 p* ; 
bare (25? + &)rsarn~/20-* 4 2/288 */20-*. (39) 


This form is rather more complicated than (37) and so it seems that 
nothing has been gained. It may be useful, however, to note that (39) 
provides a means of examining the accuracy of the method of approxi- 
mation described earlier in section 4. Here the critical value of »V— 
was taken to be the value M,, say, which is the P-probability level 
obtained from a table of the standard mean square distribution where 
the degrees of freedom, F, are defined by 


F-! = ( 2, d,?s8;4f 7-1) ( > Xgg?7)-? = ryv. (40) 
Now by straightforward application of the Cornish-Fisher formula it 
may be shown that 
2 
My = 1+ V2¢F-¥? + = (¢? — 1)F-' + ete. (41) 
and hence (39) gives 


2 
vV-! < Mp —- 3 (2? + 1) F—"(rsere1?v pap 1) + etc. (42) 


(The terms of order F-*/? could have been written down in (41) and 
(42) but the above will suffice to make our main point.) The approxi- 
mation in section 4 amounted to saying that Pr[vV-!<M,r]=P, and 
(42) now shows that the first corrective term to My depends on the 
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magnitude of F-1(rsr2-*v—1). We cannot say in general that this 
quantity must be small, but from numerical investigation it appears 
that it usually will be in the kind of situation where it is worth while 
estimating V at all. Unless the F calculated from (40) is, say, greater 
than ten, it does not seem to me that any estimate of V is of much prac- 
tical use. But if F is greater than ten we shall almost invariably find 
that F-'(rer2-*v — 1) is small and the use of the mean square distribu- 
tion to give confidence limits for V will not involve great errors. 


8. A PRACTICAL EXAMPLE 


Brownlee ({3] p. 117) has given some data relating to the percentage 
of a certain ingredient in a certain material. From each of six vats of 
the material two bags were taken. From each bag two samples were 
taken and on each sample duplicate analyses were made. Making the 
usual assuraptions about random and normal variation, let us denote 
the “true” between vat variation by o,, the between bag variation by 
o», the between sample variation by ¢, and the analytic variation by 
o,. Brownlee gives the following analysis of variance table: 























TABLE 3 
Degrees M Quantities 
Source of of Free-| Sum of | _©8? Estimated by 
Variation dom Squares Squares Mean Squares 
(fi) (%) (o¥2) 
Vats 5 66.25 13.250 | 8e,?+40;?+2¢,?2+0,? 
Bags within vats 6 36.25 6.042 4o;? +2¢,? +0, 
Samples within bags 12 45.50 3.792 2o0,2+0," 
Analyses within samples 24 20.00 0.833 oe" 
Total 47 168 .00 








From this table unbiassed estimates of the individual components of 
variance can be deduced. I have earlier in the paper expressed my 
skepticism as to the value of some of these estimates in themselves 
when obtained from small amounts of data. Only very imprecise 
estimates can be obtained. The situation may be somewhat better, 
however, if we are interested in a particular linear function of the 
components of variance for which there may be a reasonably good 
estimate. For instance the gross variance of the material, defined by 


gg” = (a," + oy? + o,”) (43) 
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may in fact be ultimately of most interest to us. (I am assuming o,? 
to be purely experimental error and not to contain a component 
ascribable to variability in the material.) It will in general be possible 
to find a better estimate of ¢,? than of the individual components. 
Whether even this estimate is satisfactory can however only be seen 
after further examination. 

Let us denote the four mean squares in Table 3 by s,*(¢=1, - - - 4) 
and the corresponding quantities which they estimate by o;*. (These 
are not the same quantities which Brownlee denotes by ¢;,?.) It will be 
convenient also to use V interchangeably for o,*. Then from (43) and 
Table 3 we see that 


1 
V= r (a3? + on? + 203? — 40,7) (44) 
which will be estimated by 
1 
v= 8 (8? + 8? + 283? sal 4s,”). (45) 


In Table 4 we set out systematically the further quantities which 
enter into our analysis. We have v= 2.943 and the 

















TABLE 4 
Ag2s,4 dgiss* 
Ne ase? -_— 
fi f? 
-125 1.656 .5485 .1817 
.125 -755 .0950 .0120 
. 250 -948 .0749 -0059 
— .500 — .416 .0072 — .0001 
v=2.943 ro = .7256 f32= .1995 














estimated standard error of v is (2rn)/?= 1.204. The coefficient of vari- 
ation of v is 41 per cent—somewhat large it is true. However speaking 
roughly (cf. section 2 above) this estimate is about as good as we would 
obtain in a situation where 13 independent observations are available 
from some homogeneous normal population with variance V. For some 
purposes this is regarded as useful. 

Suppose now that some particular confidence point for V be required 
—say a value such that the chance is only 0.05 of its being exceeded by 
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the true V. Let us proceed by using the approximate degrees of freedom 
method. We have from (40) and Table 4 


F-' = ryv-* = 0.08378; F = 11.937. (46) 


Now the lower five per cent point of the mean square distribution with 
F = 11.937, is Mr =.435. Also since 


Pr[vV-! < My] = Pr[V > Mr], (47) 


the required confidence point for V will be vM r~', which in the present 
case is 6.765. 

An indication of the order of magnitude of the error involved in the 
approximate use of the mean square distribution is obtained by a 
consideration of (42), where we must place §= — 1.6449. We then have 
with the aid of Table 4 


= @e + 1)F-(rera-%v — 1) = 0.041. (48) 


This is about 10 per cent of My. A more exact theory would not there- 
fore be likely to give a confidence point for differing by much more 
than 19 per cent from the figure 6.765 provided by the crude use of 
effective degrees of freedom. 


9. FURTHER DISCUSSION 


It is clear from the foregoing that the approximation which stems 
from the use of the Pearson Type ITI curve takes one a long way, and 
from the practical point of view this may be all that one wishes to know. 
It may be expected, however, that, by using the modifications de- 
scribed in equations (37) or (42) we should produce probability state- 
ments which may be validly applied for smaller f; than are the cruder 
earlier approximations. I have not investigated this in detail, but a 
fuller discussion of these modifications is given elsewhere by A. Huitson 
loc. cit. Huitson gives an independent derivation of the series (42) to 
terms in F-*/? and investigates some of its properties numerically. He 
has also calculated tables of confidence points of v/V applicable to the 
case of a linear function of two variances where the coefficients \, and 
dA: are both positive. 

In all this work it shouid, of course, be stressed that we are assuming 
the original variables with which we are dealing to be normally dis- 
tributed and hence that the variances s,* have the standard norinal 
theory distributions. One knows that departures from normality 
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have effects on the distributions of the s;? which are by no means 
negligible. Nevertheless it still appears to me that the normal theory 
should receive satisfactory treatment before other complicating fac- 
tors are considered. 


10. SUMMARY 


Well-known approximations to confidence points for linear func- 
tions of variances have been discussed. Modifications of these approxi- 
mations designed to give more exact results are developed in sections 
6 and 7. These permit the range of applicability of the earlier approxi- 
mations to be more close{v examined. 
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The analysis of complex factorial experiments is performed 
most efficiently and reliably by means of a tabular procedure 
suggested by Yates and described originally for the case of n 
factors each at two levels. In the present paper the method is 
generalized to factors at any number of levels, not necessarily 
the same for each factor. The calculations involved lend ther- 
selves to standard punch card equipment, and a fully pro- 
grammed example of an experiment with factors at 2, 3, and 4 
levels is described. 


1, INTRODUCTION 


N ANALYZING the results of a factorial experiment it is sometimes 

desirable to calculate separately a complete set of means and mean 
squares each relating to one degree of freedom. This complete analysis 
is most valuable in complex experimental designs which include con- 
founding, different error components, and possibilities of error varying 
with factor level or replication, or where it is desirable to examine and 
allow for the effect of some uncontrolled variable. Yates [7] has de- 
scribed a routine tabular method of calculating all main effects and 
interactions for experiments involving n factors each at two levels 
(“2* experiments”). This method, which includes independent checks 
for each step in the analysis, has been extended by Bainbridge to cover 
factors at more than two levels [1]. The case of a 3" experiment has 
been described by Davies [3] and some other examples by Eisenklam 
[4], but a general treatment has not been published. 

The tabular method has been widely used in the analysis of complex 
industrial experiments in Australia and in some cases the calculations 
have been performed by means of standard punch card equipment 
(e.g., Bull, [2]). It is the purpose of this paper to present a general | 
treatment of the tabular method for any number of factors at any 
number of levels, not necessarily the same for each factor, and to de- 
scribe some of the techniques used in the punch card calculations. 


2. ANALYSIS OF A FACTORIAL EXPERIMENT 


For a complete factorial experiment in which factor a is tested at J 
levels, factor b at K levels, etc., the means and mean squares corre- 
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sponding to single degrees of freedom are most simply calculated by 
means of N orthogonal sums of the form 
J-—1 K~1 L—1 


Y; = 2 Zz, > * * © ING jet. --LjRt->- 
j=0 k=) iO 

where N is the total number of observations and i ranges from 1 to 
N, where rj... represents the observed result with factor a at level 
j, factor b at level k, etc., and where the sets of coefficients m;,jx:... for 
different i are orthogonal. 

In practice J —1 of these sums will be chosen to cover the main effect 
“between levels of a” and will take the form 


J-1 

L m;X; 

j=0 
where X; is the sum of all results obtained with factor a at level j. 
The sets of coefficients m; will depend on the components of the main 
effect desired; for example if linear, 2nd degree, etc., components are 
desired and the levels of a are equally spaced, the coefficients for 
orthogonal polynomials given by Fisher and Yates [5] will be used. 
A further K—1 of the sums, of analogous form > *>} m.X,, will 
represent the main effect “between levels of 6,” and so on for all fac- 
tors. Components of the first order interaction between factors a and 
b, of which there are (J—1)(K—1), are derived from the sums 

j=} SE mym:.X p, where each set of m; values is used in turn with 
each set of m, values, and X, is the sum of all resulis obtained with 
factor a at level j and factor 6 at level k. Interactions between other 
pairs of factors and higher order interactions are obtained similarly. 
Finally, the means and mean squares for single degrees of freedom 
are given by Y;/ >.m? and Y;*/ }-m?, respectively, where > m? is the 
sum of squares of all coefficients appearing in the formula for the par- 
ticular Y;. 
3. GENERALIZED TABULAR METHOD 


The tabular method, which enables the routine calculation of all 
the sums required for the complete analysis of a factorial experiment, 
will now be described in general terms. 

The experimental results zm... are written in the first column of the 
table in the following order. For the first J entries all factors are at 
level 0, except for factor a which starts at level 0 and advances to 
level J—1. The next J entries are similar except that they relate to 
factor b at level 1, while the third set has b at level 2, and so on. The 
first JK entries thus cover systematically all combinations of levels 
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of factors a and b with all other factors at level 0. Results for higher 
levels of other factors, each introduced in turn, are then listed in a 
similar manner. 

For the purpose of computation, column 1 is divided into N/J 
groups each containing J entries. The sum of each group in turn is 
then entered in column 2, filling the first N/J lines. The next N/J 
entries are sums of the form > -4>} mijz,;, where zg is the (j+1) 
entry in the g® group, and m,,; takes on the values of m; corresponding 
to the first component, A;, of the main effect of factor a (e.g., the linear 
component). The column is then completed by applying each of the 
other sets of m; values in turn to the N/J groups of entries. 

In the second stage of the computation coluinn 2 is divided into 
N/K groups each containing K entries. These groups are then subjected 
to operations analogous to those performed on column 1, i.e., first we 
have the sum of each group, next sums of the form >=} mize, and 
so on. The process is repeated using sets of coefficients, m, appropriate 
to each of the remaining factors in turn. 

A check of each stage of the computation can be made by calculating 
subtotals of each column. For example for the first column — 
T;= DX“ 2; for each j. The sum Jr} (1+mij+maj+ - 
+myj.1,;)T; should then be equal to the total of the second » bs 
which can therefore be readily checked. 

The final sums, appearing in the (n+1)* column, where n is the num- 
ber of factors, will be the values of Y; defined in section 2. Particular 
components of the main effects and interactions can be identified from 
the order of the original entries in column 1. Thus opposite entry 7,4... 
will appear the interaction A;XB,XC,- +--+, referring to the order 
in which the sets of m;, m, etc. were applied. The first entry 
(j=k=l= --- =0) is the grand total of ail observations, and those for 
which only one subscript is non-zero represent the corresponding com- 
ponents of the main effects (e.g., 7=2, k=l= - -- =0 represents Ao, 
viz., the second component of the main effect of factor a). 


4. NUMERICAL EXAMPLE 


The generalized tabular method is illustrated in Table 1 for a hypo- 
thetical 3X24 factorial experiment. The coefficients used in each 
stage of the calculation are those appropriate to a polynomial analysis 
with equally spaced levels (i.e., those from Fisher & Yates [5] table 
of orthogonal polynomials) and are set out in Table 2 (i), (ii), and 
(iii). The coefficients used in calculating the check totals are given and 
also the divisors for each component. 

Referring now to Table 1, the observed results are given in column 1 
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TABLE 1 
ANALYSIS OF A 3X2xX4 FACTORIAL FXPERIMENT 
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(3) (4) Divisor Effect Square 





24 243 ,412.04 
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872 257 ,729.00 
700 
510 2(1)? =257 ,729 
666 





total i 2,748 
check 
total 2,632 














in the order previously described. Column 2 is formed by first obtain- 
ing the sum of each group (i.e., the operation using the coefficients 
mo,j); then, applying the coefficients m,,;, the difference (third —first) ; 
and finally the (first -2 Xsecond+third) in each group (i.e., using the 





FACTORIAL ANALYSIS AND USE OF PUNCH CARDS 153 
























































TABLE 2 
(i) FACTOR a; J =3 (ii) FACTOR b; K=2 
; | oe | divisor k 0 1. |divicr 

mo,j 1 1 1 4 ™Mo,k 1 1 2 

™\,; | -1 0 1 2 ™,k = 1 1 2 

m2,j i —2 6 

| check 9 2 
check | 1 -l 3 
(iii) FACTOR c; L=4 (iv) FACTOR c; L=4 
ORTHOGONAL POLYNOMIAL ALTERNATIVE ORTHOGONAL 
COEFFICIENTS COEFFICIENTS 
(cf. section 5) 

t | 0 1 2 8 | divisor I 0 1 2 8 | divisor 
mot 1 1 1 1 1 mot 1 1 1 1 4 
m,,1 -3 -1 1 3 20 m,: |—1 ee 2 
™2,1 1-1 —1l 1 4 ™me2,1 0 0-1 1 2 
™3,1 —1 3-3 1 20 m3: |-1 —1 1 1 4 
check |-2 2-2 6 check |—1 1 1 


























coefficients m2,;). The total of this column is seen to agree with the 
check total formed from the subtotals of column 1, viz., T>—71+3T2. 
Columns 3 and 4 are calculated in turn using the coefficients in (ii) 
and (iii) respectively, finally yielding the values Y; in column 4. The 
divisors in Table 1 are found most easily as products of those in Table 
2. Thus for the main effect A:, which is more completely represented 
by AiB,Co, the divisor is 2X 2X4=16, while for the interaction B,C; 
(i.e., AgBiCs) it is 3X2 20=120. The various main effects and inter- 
actions are identified as previously explained and the mean squares 
and means found from Y,*/divisor, and Y,/divisor, respectively. 
Finally the total of the mean squares is found to check with the sum of 
squares of the observations. 


5. MODIFICATION OF METHOD WHEN FACTORS ARE QUALITATIVE 


While the tabular method is most useful when it is desired to analyze 
the results of an experiment into single degrees of freedom, it may 
often prove convenient when the factors are qualitative and straight- 
forward, analysis into main effects and their various interactions is 
sufficient. In this case the mean squares for the components of each 
main effect or interaction are simply added to give the sum of squares 
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attributable to that effect. Very simple sets of orthogonal coefficients 
can usually be selected for this purpose, as no interpretation is to be 
placed on the individual components. For example with a factor at four 
levels (the term “levels” will be retained here for convenience although 
it is not quite accurate since we are here dealing with a qualitative 
factor) the coefficients given in Table 2 (iv), which are obviously much 
easier to apply than those given in 2 (iii), may be used. 

Compared with the more orthodox method of calculating the sums 
of squares from a number of two-way, three-way, etc., tables, the 
tabular method presents much less opportunity for copying errors, 
and also possesses vhe advantage that it is completely self-checking. 
However, if one or more of the factors occurs at a great number of 
“levels”, much time would be wasted in computing individual effects 
which are not required as such. In these cases one may use the follow- 
ing method which combines features of both the orthodox and tabular 
methods of analysis. 

Considering first an experiment in which only one factor occurs at a 
great number of “levels”, the observations are ordered as for the tabu- 
lar method with this factor as the last to be dealt with. The analysis 
is then carried out by the tabular method as far as the completion of 
the preceding stage. If we take the experiment in Table 1 as an exam- 
ple, this leaves us with column 3. The first four values are obviously 
the totals for each of the levels of factor c. Their sum (2417) is the grand 
total, with a contribution to the sum of squares of 2417°/24 or 1/6 of 
the “correction term” of the four values. The sum of squares attributa- 
ble to the effect C (with 3 degrees of freedom) can be found as 
(692°-+575? +460? +690") /6—2417?/24, or 1/6 of their reduced sum 
of squares. The divisor 6 is conveniently found as the product of those 
in Table 2 for A» and Bo. In a similar fashion 1/4 (from the divisors 
for A, and By) of the “correction term” of the next four values, i.e., 
1707/16, is the contribution of the main effect A; to the total sum of 
squares, while 1/4 of the reduced sum of squares of the four values 
gives the contribution of the interaction A,C. Thus each group of four 
values yields an effect (with 1 degree of freedom) independent of factor 
c, together with the interaction (with 3 degrees of freedom) between 
the corresponding components and factor ec. 

This modification may be extended to cover two or more factors. 
For example from column 2 of Table 1 we may make three two-way 
tables, each 2X4. Analyzirg the first table viz., 


330 271 211 329 
362 304 249 369 
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in the ordinary way will give four components, i.e., correction term, 
between rows, between columns, and interaction (the last being found 
by subtracting the others from the total sum of squares for the table). 
Divided by 3 (the divisor for Ao) they will give the sums of squares 
attributable to effects J, B, C, and BC, respectively. Components 
from the second table, this time divided by 2, will yield sums of squares 
due to A;, AiB, AiC, and A,BC, while the third table (with divisor 6) 
will give those corresponding to the effect A, and its interactions. 

By judicious use of these modificatiuns the tabular method may be 
employed with advantage in large experiments, avoiding both the 
confusion likely to arise in the orthodox method and most of the effort 
spent in calculating individual effects when these are not required. 


6, USE OF PUNCH CARDS 


Provided the numbers of levels of the various factors are not too 
large, the calculation by the tabular method mainly consists of repe- 
titions of simple operations. It may therefore be performed effectively 
by means of standard punch card equipment. In general terms the 
problem of the first stage of such a procedure is to operate on the 
observed results (arranged in N/J groups each containing J values) to 
produce the values in column 2 of the table, at the same time arranging 
them in N/K groups of K values each in preparation for the second 
stage of the calculation. These values are then operated on in a similar 
fashion to produce the entries in column 3 of the table arranged in 
N/L groups of L values each, and so on. The actual procedure, using 
an [.B.M. 421 Accounting Machine' and a summary punch, will now 
be described in detail, taking as an example the experiment given above 
with J=3, K=2, L=4, and coefficients given in Table 2 (i), (ii) and 
(iv). 

‘The observed results (in the standard order) are punched in groups 
of three (i.e., J) observations per card. Each of these cards also contains 
an indicator group h k p where h (=1) serves to identify the set of 
cards; k indicates the level of factor b and goes in general from 0 to 
K-—1 (i.e., 0 for the first, third . . . cards and 1 for the second, fourth, 
etc.), while p numbers the successive groups (in this case pairs) of cards. 

For the first stage of the calculation the first operation (I.1) consists 
of the simple summation of each field over all the cards of set 1, yield- 
ing the three subtotals 7’; required for checking purposes. In the second 
operation we have the following steps: 





1 With a slightly different nomenciature the same procedure is valid for the English Hollerith total- 
rolling tabulator. 
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1.2(i) the three values punched on one card are fed into counters 1 to 
3; 

1.2(ii) the contents of counters 1 and 2 are read into the positive of 
counter 3; 

1.2(iii) this result is passed by selection on k into one of the counters 
4 or 5, for k=0 or 1, respectively; 

I.2(iv) on a change in the value of p, i.e., after each pair of cards, the 
contents of counters 4 and 5 are punched into the first two 
fields of new cards (set 2) in the summary punch. 


The third and fourth operations differ from the second only with 
regard to step (ii). For the third operation this is replaced by I.3(ii): 
counter 1 is read into the negative of counter 3; while for the fourth it 
is replaced by 1.4(ii): counter 1 is read into the positive of counter 3 
and counter 2 into itself, followed by the reading of counter 2 into the 
negative of counter 3. 

At the conclusion of Stage I of the calculation, the first N/6 (in 
general N/JK) cards in the second set will each contain two entries 
resulting from the aadition of the groups of three observations in the 
first set of cards, the next N/6 cards will similarly contain pairs of 
entries representing the (third—first), ie., the sum of the form 

I=} mij, the last N/6 cards containing the remaining pairs of 
entries corresponding to }-=} m2,z,;. The cards must also be punched 
with the indicator group hlp where h=2 and | goes from 0 to 3, p 
numbering the successive groups of four cards each. 

The first operation of the second stage (II.1) yields the two sub- 
totals 7,. The total of these should be compared with the check total 
from the 7; values (7,>—7:+3T7:) before proceeding with the other 
operations of Stage II. The second and third operations ‘are similar to 
those in the first stage except that only counters i and 2 are required 
for the first and second steps, while the results (i.e., sums or differ- 
ences) are passed by selection on / into the counters 3, 4, 5, and 6 for 
l=0, 1, 2 and 3 respectively, and are punched into four fields of new 
cards (set 3) on a change in p, i.e., after each group of four cards. 

In the final stage the first operatic; produces the four subtotals 7',, 
whose total should agree with the check total from the T;, values (27). 
The second operation has the steps: 


III.2(i) the four values punched on one card are fed into counters 
1 to 4; 

III.2(ii) the contents of counters 1, 2 and 3 are read into the positive 
of counter 4, followed by the punching of the contents of 
counter 4 into new cards of set 4 in the summary punch. 
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For the third, fourth, and fifth operations (ii) is replaced by IXI.3(ii): 
counter 1 is read into the negative of counter 2, followed by the punch- 
ing of the contents of counter 2 into cards of set 4, using the same field 
as before; III.4(ii): counter 3 is read into the negative of counter 4, 
followed by the punching of the contents of counter 4 into cards of set 
4; III.5(ii): counter 1 is read into the positive of counter 2, and counter 
3 into the positive of counter 4. Counter 2 is then read into the negative 
of counter 4 and the contents of counter 4 are punched into cards of set 
4. 

Finally the cards of set 4 are punched with the appropriate divisors 
and the means and mean squares obtained using a multiplying punch. 
Alternatively the values may be listed and the means and mean 
squares found on a desk calculator. 


7. COMMENTS ON THE PUNCH CARD PROCEDURE 


The above procedure can be modified to give the analysis into other 
orthogonal components, e.g., the polynomial components of factor c 
calculated according to the coefficients in Table 2 (iii). Its extension 
to any number of factors is straightforward, merely introducing addi- 
tional stages into the calculation. However the analysis of higher-level 
designs may offer some difficulty not only because of the larger coeffi- 
cients, but also because it may require too many counters. The splitting 
of counters provides a way out only as long as the numerical values are 
small. However, as the number of counters required for any stage is the 
sum of the numbers of levels of two consecutive factors, this difficulty 
may be overcome in some cases by alternating high-level factors and 
those having fewer levels. 

In many experiments involving large numbers of “levels” the corre- 
sponding factors will be qualitative, and the complete analysis into 
single degrees of freedom will not be required. In these cases the tabular 
method (and punch card equipment) may be used for the lower level 
factors and the final stages carried out by the orthodox method. 

The use of punch cards in the tabular method is naturally most effi- 
cient when ali factors are tested at the same number of levels, since all 
stages of the calculation are then identical. However, many of the 
operations required for different numbers of levels are very similar to 
one another and rewiring of the machine may be restricted to the use 
of different fields of a vard or different counters, the more complicated 
control wiring being almost unchanged. Thus in the example described 
the first operation is the same for all stages of the calculation, and this 
is true in the main of the second operation also. The third operation 
in the case of both 2 and 3 level factors and the third and fourth opera- 
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tions for the 4 level factor are all essentially similar. Finally the remain- 
ing operations for the 3 and 4 level factors require only small differences 
in the control wiring. Because of these similarities, the decrease in 
efficiency due to the use of mixed level designs would not in general be 
a serious obstacle in the use of punch card techniques. 

The final comment concerns the simultaneous analysis by means of 
punch cards of different experiments based on the same experimental 
design. Initially the observed results for each experiment are ordered 
as before, then taken one experiment after another. The analysis can 
then be carried out exactly as before. However it should be noted that, 
if there are M experiments, the first M of the final sums obtained will 
represent the grand totals for the experiments, the next M values will 
be the sums corresponding to the A; effects, and so on. The order 
within each group of M values will be the same as the original order 
of the experiments. 

Previous utilizations of punch card techniques in connection with 
large factorial experiments [6] followed the orthodox method of analy- 
sis and thus required a complex set of sorting operations. By contrast 
the compact and repetitive operations of the tabular method seem 
particularly suited to the use of punch cards, and this must be counted 
among its major advantages. 
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the Abstracts Editor, Professor Walter L. Smith, Department of Sta- 
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Anis, A. A., “The variance of i..e maximum 
of partial sums of a fir:te number of inde- 
pendent normal variates,” Biometrika, 42 
(1955), 96-101. 


Suppose that from a random sample 
%,°***, 2% from a normal population of 
mean zero, the successive partial sums 2, 
D+2e, M+Mat+m, +++, Mt + +> +2, are 
calculated and the maximum U,, of these 
recorded. In an earlier paper (Anis and 
Lloyd, Biometrika 40: 35 (1953)) a very neat 
expression for the mean of U, was given. 
The present paper deals with the variance 
of U,. A numerical table is given for n325 
and a simple asymptotic formula for larger 
values of n. D. R. Cox, University of North 
Carolina. 


Bartlett, M. S. and Medhi, J., “On the ef 
ficiency of procedures for smoothing peri- 
odograms from time series with continuous 
spectra,” Biometrika, 42 (1955), 143-50. 


The paper is concerned with a stationary 
time series observed at discrete time instants 
and having no strictly periodic component. 
The second-moment properties of such a se- 
ries are determined equivalently by the se- 
rial covariances or by the spectral density 
function. The authors deal with the estima- 
tion of the spectral density function from a 
semple. 

It is well known that the sample analogue 
of the spectral density, the periodogram, 
has large sampling fluctuations which do 
not decrease as the sample size increases, so 
that some process of smoothing is called 
for. Various procedures for this have been 
suggested and they are compared in this pa- 
per. The optimum smoothing depends on 
the unknown population spectral density 
and on the criterion of efficiency adopted, 
and the authors say that many of their re- 
sults are not in a form for direct practical 
use. However they give numerical results 
from which some insight into the relative 
behavior of the different methods is ob- 
tained. D. R. Cox, University of North Caro- 
lina. 


Bartlett, M. S., “Approximate confidence 
intervals: III. A bias correction,” Bio- 
metrika, 42 (1955), 201-4. 


In an earlier paper (Biometrika 40: 306 
(1953]), the author gave a general method 
of eliminating nuisance parameters when 
constructing approximate confidence inter- 
vals based on . An error in this pa- 
per is corrected and several examples are 
discussed, including problems connected 
with time series. D. R. Cox, University of 
North Carolina. 


Broadbent, S. R., “Quantum hypotheses,” 
Biometrika, 42 (1955), 45-57. 


A histogram with fairly regular spacing of 
the modes suggests a population composed 
of subpopulations with equally spaced 
means; a process resulting in a series of con- 
stant percentage changes with time may be 
expected to yield such samples (after a loga- 
rithmic transformation). Such cases are the 
topic of this paper, which deals with aggre- 
gates of normal subpopulations in which the 
standard deviations are a linear function of 
the equally spaced means. 

The author gives a regression analysis 
leading to estimates of the model positions 
and standard deviations (assuming that the 
observations have been allocated to the cor- 
rect subpopulations) and presents an ex- 
ample dealing with Svedberg’s molecular- 
weight data. The concept of “lumped vari- 
ance,” the mean square of the distance, z, of 
an observation from the nearest mode (not 
necessarily its own subpopulation mode), is 
then introduced and applied to the estima- 
tion of variances when the modal positions 
are known, but the allocation of observa- 
tions is not. Computation of a consistent 
estimator is described, and auxiliary tables 
given. Finally, a test of whether there is any 
real grouping about the modes is made by 
testing the null hypothesis that z is rec- 
tangularly distributed; this test does not 
require normality of the subpopulations, 
and may be made with as few as 20 observa- 
tions. The author gives a table of signifi- 
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cance points for this test, and presents ex- 
amples of applications in biology, psychol- 
ogy, and physics. Ricnarp L. Carrer, Uni- 
versity of North Carolina. 


Bright, Harold F., “A method for computing 
the kendall tau coefficient,” 

and Psychological Measurement, 14 (1954), 
700—4. 


Presentation and discussion of a simpli- 
fied procedure, including a facilitating me- 
chanical device. Jutzan C. Stanizy, Uni- 
versity of Chicago. 


Hannan, E. J., “Exact tests for ser‘al corre- 
lation,” Biometrika, 42 (1955), 133-42. 


Consider a first-order Markov stationary 
time series generated by an equation of the 
type Xn=pXn1i+es. Ogawara (Annals of 
Mathematical Statistice 22: 115 [1951)) 
pointed out that an unbiased estimate of 
2(1+*)~ can be obtained from the condi- 
tional regression of the x on the 2:1, and 
that a simple “exact” test of significance can 
also be constructed. 

In the present paper it is shown that the 
asymptotic efficiency of Ogawara’s statistic, 
bh, tends to one as p tends to zero and that a 
more efficient statistic can be obtained by 
averaging b; with be, the regression coeffi- 
cient of 22; on 2. The methed is extended 
to give a test for serial correlation in the 
residuals from a regression equation. This is 
illustrated with a numerical example. D. R. 
Cox, University of North Carolina. 


Horst, Paul, “The estimation of immediate 
retest reliability,” Educational and Psycho- 
logical Measurement, 14 (1954), 705-8 


The Kuder-Richardson Formula 20 un- 
derestimates reliability if the variance of 
item difficulties is not zero and/or if not ali 
items in the test measure a single function. 
Horst proposes and proves the following 
formula for estimated retest reliability, 
which is expected to give a considerably 
higher figure than K-R 20 because it is not 
attenuated by dispersions of item difficul- 
ties or by item heterogeneity: r= 1—(n—M) 
/as*, where n is the number of items in the 
test, M is the mean of test scores, a is the 
number of choices (options) for any item 
(the same for all items), and o* is the vari- 
ance of test scores. Junttan C. STANLEY, 
University of Chicago. 
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Jowett, G. H., “Sampling properties of lo- 
cal statistics in i stochastic se- 
ries,” Biometrika, 42 (1955), 160-9. 

This is one of a series of papers stemming 
from the author's interest in problems of 
sampling and statistical analysis connected 
with industrial time series. The sampling 
moments in large samples of many statistics 
calculated from time series depend on the 
summation of long series of products of se- 
rial cumulants, or, for normal series, prod- 
ucts of autocorrelations. This is not only 


ee en ok hee hes 
these have sampling moments that can 
themselves be calculated from the short 
term behavior of the series. Several exam- 
ples are discussed, i 
reduced 


and a problem connected with spatial sys- 
tematic sampling. D. R. Cox, University of 
North Carolina. 


Page, E. S., “Control charts with warning 
lines,” Biometrika, 42 (1955), 243-257. 


Several different inspection schemes are 
compared, involving warning as well as ac- 
tion lines on the control chart. The author 
takes as a criterion the average run length 
(average number of articles inspected be- 
fore action is indicated, the true quality re- 


maining fixed). 

One specifies action if (1) n 
consecutive points (out of a sample of N) 
fall between the warning and action lines, or 
(2) any point falls outside the action lines. 


fied average run length 
normal population of known standard devi- 
ation). 

The author also develops a method jor 
controlling both the mean and the standard 
deviation by the use of one chart, for cases 
in which changes in the standard deviation 
are relatively rare or unimportant. In this 

, means of samples of N are plot- 
ted on the chart and rules as above are ap- 
plied, with the addition of (3) action if two 
out of any set of n consecutive points fall 
outside opposite warning lines. The average 
run length function is derived and tabu- 
lated to enable selection of the appropriate 
inspection scheme (for a normal popula- 
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tion), and numerical examples are given. 
Ricuarp L. Carrer, University of North 
Carolina. 


Rade, Lennart, “A note on a modified t 
test,” Skandinavisk Aktuarietidskrift (1954), 
65-70. 

The author considers the modified t-sta- 
tistic in which sample standard deviation is 
replaced by a mean of K ranges. An approx- 
imation to the distribution is proposed as- 
suming normality of the mean range. Values 
of percentage points of the statistic under 
the null hypothesis computed from the ap- 
proximation are compared with exact values 
given by Lord (Biometrika 34 (1947)). A 
simple approximate formula for the O.C 
curve of the two-sided test is given. Davip 
L. Watiacn, University of Chicago. 


Sampford, M. R., “The truncated negative 
binomial distribution,” Biometrika, 42 
(1955), 58-69. 

Suppose that an observed discrete fre- 
quency distribution is available, assumed to 
be derived by the random sampling of a neg- 
ative binomial distribution in which the ob- 
servations at zero have been suppressed. 
The paper deals with the estimation of the 
population parameters. 

Two methods are considered, a relatively 
quick one based on the first two moments 
and the maximum likelihood solution. The 
asymptotic efficiency of the first method is 
shown to be at least 80% except in very un- 
favorable cases. Numerical methods for 
solving the equations are described and il- 
lustrated by numerical examples. D. R. 
Cox, University of North Carolina. 


Tate, Robert F., “The theory of correlation 
between two continuous variables when one 
is dichotomized,” Biometrika, 42 (1955), 
205-16. 

The problem of biserial correlation arises 
when one is sampling from a bivariate nor- 
mal population in which one of the variables 
has been dichotomized, giving rise to only 
two observable values, say 0 and 1, and one 
wishes to use this dichotomized sample to 
estimate, or to test hypotheses concerning, 
the correlation coefficient p of the original 
bivariate normal distribution. The problem 
of bi-serial correlation occurs frequently in 
psychological work, especially in test con- 
struction and validation. 
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son distribution,” 
tidshrift (1954), 1-9. 


the correlated multivariate Poisson distri- 
bution, and the conditions and norming for 
which the latter tends to the multivariate 
normal. The conditions and norming for 
normality turn out to be the natural ones, 
in view of the fact that the marginal distri- 
butions are univariate Poisson. The author 
also gives a characterization, showing that 
the multivariate Poisson is factor closed, 
and compares it in several other respects 
with the multivariate normal. Hzersert T. 
Davin, University of Chicago. 


Thomas, George W., “Bounds for the ratio 
of range to standard deviation,” Bio- 
metrika, 42 (1955), 268-9. 


This note points out that the bounds for 
the ratio of range to standard deviation in 
the same sample are independent of the 
form of the population, and are determined 
by simple configurations of the sample 
points. A table of these bounds for various 
sample sizes is given, providing an immedi- 
ate check for gross errors in routine compu- 
tations. The author also gives a table of per- 
centage points of the distribution of this 
ratio for samples of size three from a normal 
population. Ricnarp L. Carrer, Univer- 
sity of North Carolina. 


Thompson, H. R., “Spatial point processes, 
with applications to ecology,” Biometrika, 
42 (1955), 102-15. 

The conventional methods for the analy- 
sis of a real sampling data give information 
about the over-all sampling distribution, 
but little or none regarding the pattern or 
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trend of values within the area sampled—a 
matter of importance in many fields, includ- 
ing ecology. This paper concerns the appli- 
cation of point process techniques to deriv- 
ing the probability relations between the 
numbers of plants occurring in contiguous 
rectangular quadrats of a sampling grid. 
The model assumes that seeds are ran- 
domly distributed over the area in the Pois- 
son distribution and that each seed then 
gives rise to an independent family of off- 
spring plants distributed around the parent 
seed according to the isotropic normal dis- 
tribution (the number of offspring per par- 
ent following an arbitrary probability law). 
Some special cases are worked out in detaii, 
and the author discusses application of the 
analysis of variance to successive nested ag- 
gregates of quadrats as a test for significant 
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patterns. Ricwarp L. Carrer, University of 
North Carolina. 


Wilk, M. B., “The randomization analysis 
of a generalized randomized block design,” 
Biometrika, 42 (1955), 70-9. 

A generalized randomized block design is 
one in which each treatment occurs the 
same number of times in each block. The 
author considers in detail the randomization 
properties of the conventional analysis of 
variance, when the treatment effects are not 
the same for all blocks or for all experimen- 
tal units. It is shown that certain compari- 
sons in the analysis of variance are biased 
when the treatment effects are not con- 
stant. D. R. Cox, University of North Caro- 
lina. 
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The Teaching of Statistics in Schools. A Report of the Council of the Royal Sta- 
tistical Society. 1951. Pp. 12. Printed separately 1 s. Reprinted in the Journal of 
the Royal Statistical Society, Series A, Vol. CXV, Part I, 1952. 


Heren M. Waker, Columbia University 


HE Council of the Royal Statistical Society here presents its considered 

recommendations concerning the teaching of statistics at the secondary 
school level. These recomendations are based on the report of the Teaching 
of Statistics Committee appointed by the Council, a distinguished committee 
composed of E. 8. Pearson, Chairman, R. G. D. Allen, B. C. Brookes, H. 
Campion, William Elderton, C. Oswald George, R. F. George, D. Heron, 
J. O. Irwin, E. C. Rhodes, L. H. C. Tippett, J. Wishart, and C. A. Moser, 
Honorary Secretary. This is the same committee which in 1947 produced a 
report on the Teaching of Statistics in Universities.! 

The main theme of this memorandum is the conviction that the statistical 
approach and acquaintance with certain elementary statistical techniques 
are of such value to the citizen of « democracy that they should be taught in 
the secondary school as a part of general education. Among the purposes 
which the Council believes such training should serve are the following: to 
encourage a habit of disciplined thinking about ordinary affairs in terms of 
quantities; to appraise figures critically, to appreciate their fallibility and 
limitations, and to consider tke effects of the errors with which figures meas- 
ure things; to examine criticaily and accept with reserve the conclusions 
drawn by other people from statistical data; to have some understanding of 
the more elementary statistical techniques which are part of the small change 
of public controversy and intercourse. Among the techniques and concepts 
which all citizens need to understand well enough to appreciate the message 
they convey and to avoid their gross misuse, the Council names these: 
charts, averages, rates, percentages, sampling, aggregates, probability and 
the frequency distribution. These concepts are not commonplace and special 
efforts will be required to make them so. Moreover all citizens need some 
understanding of such social and economic statistics as the cost-of-living 
index, or birth, death, and morbidity rates, in order to read the newspapers 
and to be conversant with public affairs. 

The memorandum raises the question as to whether it is preferable to 
establish separate courses in statistics in the secondary school or to introduce 
the statistical way of thinking into the general curriculum. Without deciding 
this issue, it states emphatically that “the chief good effect that statistics 
can have in the general education is in affecting the intellectual approach to 
other subjects and to life. Whatever the kind of provision made for teaching, 
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this ultimate effect should be kept to the fore as the main aim.” (Italics 
mine.) 

While written for the English schools, the argument in this memorandum 
has equal application to the American. Recognizing the competition of 
many important subjects for place in the curriculum, nevertheless I believe 
that the case for statistics as a subject of value in general education is too 
strong to be denied. There are however great practical difficulties to be over- 
come and perhaps England has come nearer to conquering those difficulties 
than we have. The report states that “It is for teachers and associated educa- 
tionists to deal with these details, and we are primarily concerned to estab- 
lish the case, so as to give teachers the urge to deal with the details of applica- 
tion”. 

One of the details to be considered is the selection of topics appropriate to 
the secondary school. The Council presents a list of topics almost all of 
which I know from experience can be taught in the high school. However, 
before statistics is likely to make much headway in the American secondary 
school some way must be found to convince teachers and admistrators that 
these subjects can be made meaningful to the high school pupil. 

A practical problem of great importance is the search for types of data 
which high school students can be encouraged to collect. The memorandum 
gives little help here. It makes one or two suggestions as to collecting data but 
sets up no general criteria. I have long believed that data appropriate for 
use by high school pupils must meet these criteria: 

(1) The data must have reality for the student (car loadings or stock 
market prices will not meet this criterion for most high school students). 

(2) The data must not appear trivial. (In one high school treatise stu- 
dents are asked to pluck blades of grass “at random” and make a fre- 
quency distribution of their lengths. This undertaking is not calculated to 
enhance respect for statistical method.) 

(3) The data must not embarrass the students. (Personal data on such 
matters as height and weight, spending money, or grades would be real and 
non-trivial, but might for some high school studerts be painfully embarrass- 
ing.) 

(4) Securing the data must not take so much time and effort that the 
students will lose interest in analyzing them. 

Finding data which meet these conditions may be one of the most difficult 
steps in facilitating the recommendations of the Council. 

The provision of suitably equipped teachers is recognized by the Council 
as the key to the teaching of statistics in the secondary school. They speak 
of the need for vacation or evening courses for teachers, given “by experienced 
school teachers and by lecturers in close touch with schoo! teaching. A 
course on statistics by a mere statistical expert, particularly if he has more 
mathematical statistics than imagination, in likely to do more harm than 
good”. 

The Council also recognizes the need for texts especially prepared for the 
secondary school and comments that this “is a field of activity primarily for 
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those experienced in teaching”. I should like to add my own opinion that the 
preparation of a good high school text will be a far more difficult business 
than the preparation of a good college text and will probably have to be 
based on considerable experimentation. 

The section of the report on school examinations giving an outline of the 
statistical content of the syllabuses of four examining authorities is of less 
immediate interest to Americans. 

All statisticians who are concerned either with the future of the profession 
or with the general statistical understanding of citizens should read this 
report and bring it to the atiention of persons working in the field of the high 
school curriculum or responsible for the training of secondary school teachers. 
There is much hard work ahead before the recommeidations of this Council 
can be even partially applied in the American high school and it will call for 
enthusiastic cooperation on the part not oily of school people but of statis- 
ticians who believe in the general education value of their subject. 


The Elements of Probability Theory and Some of Its Applications. Harald 
Cramér. New York: John Wiley and Sons; Stockholm: Almquist and Wiksell, 
1955. Pp. 281. $7.00. 


Lro Katz, Michigan State University 


To: new textbook is evidently intended for a full academic year’s work 
by junior or senior level students who have “some working knowledge 
of analytic geometry, calculus, and algebra, including determinants”. The 
background requirements, quoted from the preface, are almost verbatim 
those given in the author’s Mathematical Methods of Statistics; in this case, 
they are more realistic. 

There is a strong family resemblance, extending to typography and for- 
mat, between this book and the earlier Mathematical Methods (see reviews, 
this Journal, 1947, pp. 174-9). It seems almost inevitable that this will 
come to be known as the little Cramér. In spite of its (welcome) relatively 
diminutive size, the book covers the elements of the theory of statistics most 
adequately in all except two respects which the author has chosen to play 
down. 

First, in the first part, on foundations, the treatment of probability is 
quite concrete as well as compact (46 pages). It has very little of the abstract 
flavor of Feller’s treatment of the same topics, for example. Some, at least, 
of those who use this as a textbook, will wish to supplement and strengthen 
this presentation. It seems a little anomalous, especially in view of the title, 
that the probability theory presented should be so narrowly restricted to 
that necessary for the further developments in the next two parts. 

Second, developments in stat.stical theory of the past two decades are 
omitted in their entirety. The Neyman-Pearson notion, of choosing among 
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tests at fixed level of error of the first kind so as to minimize érrors of the 
second kind, is stated, but no attempt is made to employ likelihood ratios, 
for example, to construct optimal tests, nor is the Neyman-Pearson theory 
used to illuminate the difference between simple and composite hypotheses. 
Many potential users of the book will feel that students should be exposed to 
the viewpoints and principles of statistical decision theory even in their first 
“elements” course. 

Within these limitations set by the author, it is hard to see how a better 
book could have been written. Cramér’s clarity and conciseness of expression 
should serve as an ideal for writers of other textbooks. The many examples 
given illustrate and extend, in a masterful way, the theory presented. Excel- 
lent lists of problems are given at the end of each chapter (many more than 
in the earlier Mathematical Methods). The reviewer would urge most strongly 
that a very considerable portion of classroom time be given to discussion of 
these examples and problems. 

The organization of the book is almost identical with that of the author’s 
Mathematical Methods. The first part, on foundations (probability theory) 
has been discussed above. The second part (98 pages), on random variables 
and probability distributions, covers distributions in one dimension gen- 
erally, the binomial, Poisson, normal, x?, ¢t and F distributions in particu- 
lar, and variables and distributions in two and more dimensions. The third 
part (88 pages), on applications, starts with introductory remarks on the 
analogies between probability theory and statistical experience and discus- 
sion of the descriptive treatment of statistical material. It continues with 
sampling distributions and problems of statistical inference, namely, estima- 
tion, confidence interval construction and testing of hypotheses. The third 
part concludes with discussion of tests based on x* and a catch-all chapter of 
further applications, including theory of errors, some regression problems, 
analysis of variance, survey sampling and statistical quality control. 

A brief appendix gives and lists some properties of (complete) ’- and B- 
functions, and Stirling’s formula. A list of references for further reading is 
given, as are solutions for all problems. The tables include percentage points 
of the normal, x?, t and F distributions. The typography is excellent; proof- 
reading, a little less than perfect. 


Managerial Statistics. Kermit O. Hanson. New York: Prentice-Hall, Inc. 1955. 
Pp. xiv, 306. $4.75. 


CiaupeE 8S. Bringear, Union Oil Company of California 


ss statistics”, in the words of the author, “deals with data 
and methods which are useful to management executives in planning 
and controlling organization activities”. These data and methods are 
covered in twelve chapters devoted, in order, to these topics: (1) introduc- 
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tion, (2) sources of data, (3) collection of data by sampling, (4) frequency 
distributions and averages, (5) measures of variation and skewness, (6) 
analysis of general business conditions (including a brief section on index 
numbers), (7) time series analysis, (8) correlation analysis, (9) internal ad- 
ministrative control, (10) tables, (11) charts, and (12) statistics and mana- 
gerial control. 

The two chapters on charts and tables constitute about 25 per cent of the 
book and are excellent. This material is almost entirely drawn from actual 
business reports and consequently provides good illustrations of the variety 
of ways in which business data can be effectively presented (and sometimes 
distorted). Unfortunately, the balance of the statistical material does not 
maintain the standard of these two chapters. 

While the order of development is sound, the sections devoted to elemen- 
tary statistical methods are generally uneven and disappointing. Most of 
these sections seem tc resemble a sketchy summary of some standard text 
on business statistics, such as Croxton and Cowden’s Applied General 
Statistics. The reader inexperienced in statistics undoubtedly will be con- 
fused at times because of the author’s choice of words and notation; at other 
times, unfortunately, his confusion will result from a more fundamental 
reason. 

Two simple examples should illustrate the confusion caused by an occa- 
sional choice of the wrong words, or by the failure to place the proper qualifi- 
cation at the proper place: (1) In the discussion of the measures of central 
tendency, the author notes (p. 57) “In series of data which include some very 
high or very low values, the value of the mean will not be representative of 
the values which occur most frequently....In such instances the n.e- 
dian ... may be a more representutive figure”. This is followed three pages 
later by the correct statement “In a series of income data the mode is that 
income which occurs most frequently....” (2) After illustrating how to 
isolate cyclical variation in time series, he wrote (p. 136) “Crude attempts 
to remove irregular fluctuations (I) are sometimes made by computing mov- 
ing averages of adjusted sales (CI). Although this technique may be of 
academic interest, it holds little practical value for businessmen”. This is 
followed on the next page by [One of the steps in eliminating seasonal varia- 
tion is] “computation of a centered 12-month moving average which elimi- 
nates s-asonal and irregular forces. . . .” 

Notavion is also occasionally confusing. For example, although “M” is 
used for the mean and “s” for the sample standard deviation, “co,” is used 
for standard error of the mean of both the sample and the population. In 
Table 11, “M” stands for both the mean and the class interval midpoint. 

The other, more basic source of confusion is not so easily illustrated. It 
may be partly a matter of emphasis on what (to this reviewer, at least) are 
the fundamental notions of statistics that can be helpful to business execu- 
tives, and partly the result of an occasional misstatement (such as the 
“confidence limit” discussion on pp. 80-1). 
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Discussions of such basic subjects as the standard deviation, the normal 
distribution, and the correlation of time series all seem to reflect some vague 
confusion, possibly resulting from the author’s desire to keep things at a 
very elementary level. (On p. 94 he notes, for example, “Formulas and sym- 
bols are a form of shorthand which may be a convenience”.) About six pages 
are devoted in total to the standard deviation and the normal distribution 
and about 20 pages to the correlation of time series. The standard deviation 
discussion never gets to the fundamental importance of this measure nor, 
for that matter, does it provide a really good understanding of what the 
measure is. The section follows about six pages devoted to the range, the 
interquartile range, and the average deviation which, for practical matters, 
could be sharply compressed or even eliminated. The variance is never de- 
fined, although the term does appear at least once (p. 186). 

The norma! distribution is illustrated with an example of the bincmial 
(throwing dice} and followed by the comment (p. 77) that “Obviously, cis- 
tributions of business data which conform to the normal curve are relatively 
rare”. However, we are told (p. 79) that this raises no difficulties since “the 
‘errors,’ or differences of sample means and the true mean of a distribution 
fall in the approximate pattern of a normal curve even though the distribu- 
tion from which the samples were drawn was not a normal distribution” 
and that “an obvious illustration would be samples of incomes”. This re- 
viewer is afraid that this is not so obvious. 

The chapter on the correlation of time series for use in forecasting sales is 
handicapped by a tedious approach to the correlation coefficient (first, com- 
pute the regression equation, then the standard error of estimate, then 
the coefficient of nondetermination, then the coefficient of determination, 
and finally the coefficient of correlation). On establishing the significance of 
the final result the author comments only (p. 173) that “It is impossible to 
state arbitrarily how large a correlation coefficient must be to have signif- 
icance”. He illustrates the use of the standard error of estimate by determin- 
ing a range for the dependent variable estimated from a value of the in- 
dependent variable greater than any of the actual observations used in the 
original analysis; the concept of the standard error of forecast is not men- 
tioned. The problem of spurious correlation is considered briefly and then 
dismissed as not serious (p. 158) “in so far as sales forecasting is concerned” 
as long as a “logical connection exists” between the independent and depend- 
ent variables. 

All in all, the main difficulty of the book seems to be that it is not clearly 
directed at any single group. It is not, as the title seems to imply, a book to 
provide business executives with a knowledge of standard methods and data 
that will help them run their businesses. It comes closer to being a handbook 
for business administration students (the word “student” appears in the text 
several times) who will find themselves working for large corporations and 
who occasionally will be expected to manipulate some data to produce an 
average, a sales forecast, or maybe even a standard deviation. This reviewer 
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believes that if a book is directed at business executives it should have many 
actual illustrations of the use of all types of statistical techniques (not just the 
elementary ones) that can be profitably used in the business world. On the 
other hand, a book directed at business administration students should 
concentrate on getting across the fundamental notions soundly, with clear 
examples of their limitations. This book, for example, oddly enough omits 
any discussion of the histograr: and the use of semi-logarithmic chart paper, 
two techniques that are used widely in the analysis of business operations. 

In the introductory chapter, the author notes (p. 2) that “Excluded from 
this text . . . are those statistical techniques which have only limited or no 
practical application to management problems”. It would be unfortunate 
indeed if the world’s business executives became convinced that the methods 
of statistics could do no more for them than what is illustrated by this 
book. 


Statistical Methods in Educational and Psychological Research. James E. Wert, 
Charles O. Neidt, and J. Stanley Ahmann. New York: Appleton-Century-Crofts, 
Inc., 1954. Pp. vii, 435. $5.00. 


Lye V. Jongs, University of Chicago 


1TH the possible exception of two or three chapters, it is correct that 
Wei lack of college mathematics does not preclude use of this volume”, 
as is stated in the Preface. Designed for a two-semester course sequence, the 
contents include a comprehensive treatment of descriptive statistics, a short 
chapter on estimation, chapters on significance testing, including t, chi 
square, analysis of variance, analysis of covariance, chapters on linear and 
non-linear regression, and sections on discriminant analysis. Also included is 
a chapter on principles of psychological test construction. Each major topic 
is well-illustrated with examples primarily drawn from the field of education. 
In a text addressed to students who lack mathematical training, it would 
seem especially important for verbal discussion to exhibit precision and clar- 
ity. Such characteristics are not uniformly found in this book. There might 
be cited numerous examples of careless exposition or of inefficient organiza- 
tion of topics. Related to this is the apparent tendency to provide a greater 
volume of verbal discussion than seems necessary or desirable. The tendency 
is more marked in the first 130 pages, devoted primarily to descriptive statis- 
tics and to general comments regarding the nature of statistical inference. 
While m ost of the individual instances of carelessiiess are of relatively minor 
importance, the effect is at least one of distracting the reader. And, occasion- 
ally, misleading or ambiguous statements appear in the introduction of 
fundamental theory or in discussion sections critica! to the understanding 
of the student reader. 
In contrast to inadequacies in some sections, several chapters of the book 
exhibit succinct exposition of material extremely valuable to the student. 
Notable are the chapters on linear and non-linear regression, on serial corre- 
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lation, and the sections on two-sample discriminant analysis. In no other 
source are these topics presented so adequately for students of education 
and psychology. 

A serious shortcoming of the book is the rather haphazard treatment of 
assumptions underlying interpretation of statistical test procedures. The 
authors correctly allude to results of recent sampling experiments which 
have demonstrated some statistical sampling distributions (e.g., the F- 
distribution) to be less sensitive to violation of certain assumptions than was 
previously suspected. However, such findings do not justify failure to attend 
to assumptions. Even from a purely practical viewpoint, insufficient work 
has been reported on the effect upon derived distributions of given depar- 
tures from normality or homogeneity of variance. 

Particularly puzzling is the distinction drawn between alternative measures 
of association in a 2X2 contingency table. It is stated that interpretation of 
the phi coefficient (referred to in some sources as the four-fold point coeffi- 
cient, although actually applied to two-fold tables) demands an assumption 
that each dichotomized characteristic “is normally distributed although not 
more accurately evaluated” than by the specified dichotomy. With no fu;- 
ther explication, the reader is left to ponder the concept of normally dis- 
tributed point distributions. On the other hand, in discussing the coefficient 
of tetrachoric correlation, no mention is made of the need to assume under- 
lying normality of distributions. 

As a result of uneven attention to rigorous style, this text would seem a 
less likely candidate for a general course in introductory statistics than some 
other recent publications designed for students of education or psychology. 
Its outstanding feature is the unusually competent treatment of regression 
and discriminant analysis, a feature sufficient to deem it a valuable supple- 
ment to a student’s text-book library. 


Principles of Medical Statistics. Sixth Edition. A. Bradford Hiil. New York: Ox- 
ford University Press, 1955. Pp. ix, 314. $4.00. 


B. G. GreenserG, University of North Carolina 


= latest revision of one of the most popular texts of its type is disap- 
pointing. The major change consists of an additional chapter (viz. 
Chap. XX) in which additional problems and pitfalls in conducting clinical 
trials are highlighted. A good portion of this chapter was given in lectures 
by the author in this country in 1953. Minor changes include tables of ran- 
dom numbers (which the author himself constructed by drawings from other 
tables of random numbers) and the bringing up to date of the illustrative 
examples. 

The book was originally published in 1937 to meet the needs of research 
workers on medical problems. For this objective, the tone and level of the 
language were well chosen and the exposition was both lucid and logical. 
On the other hand, the development of new statistical tools plus the spread- 
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ing familiarity of old tools suggest that such standard techniques as Student’s 
t-test and the analysis of variance and covariance are not above the com- 
prehension of research workers in the medical sciences. These were under- 
standably omitted in the first edition but have been conspicuous by their 
absence in the later and present revisions. In lieu of introducing the t- 
distribution, the reader is repeatedly warned (e.g. p. 93, 117) that the 
author’s methods do not work when the sample is small. (How small?) 

The inclusion of Chapter XX is a welcome addition for it adopts a positive 
approach toward clinical field trials. In former editions, the author limited 
his advice to the reader about the possible mistakes in designing, conducting, 
and analyzing the clinical experiment. Although those materials on fallacies 
made excellent reading and provided good teaching examples, they con- 
stituted a negative attitude toward design by concentrating on what not to 
do. The new chapter changes this by starting at the beginning and advising 
how to proceed, step-wise, with cogent reasons provided for each step in 
the process. 

The reviewer feels that this book is still unchallenged as the best text in 
its field. The criticisms that are offered here, however, reflect the hope that 
future revisions, even if they do increase the size of the book, will help to 
make it a more perfect text. 

Chapter I is concerned with justification for the statistical method as one 
of the tools of research in clinical medicine as well as in public health. The 
second chapter deals with the problems of selecting a representative sample 
from the target population. The author illustrates clearly several kinds of 
biases that can unknowingly creep in during the selection of a sample by 
using hospitai statistics, volunteers, questionnaires, or even in clinical trials 
where the day on which treatment was started is considered to have been a 
random selection with respect to treatment. 

Chapter III discusses the tabular and graphical presentation of statistical 
data. The material on graphs is good but rather scanty omitting completely 
any reference to semi-logarithmic graphs, bar charts, or pictorial statistics. 

The next chapter on measurement of central tendency deals with calcu- 
lating the arithmetic mean, median and mode in grouped and ungrouped 
data. In this connection, the author might have explained why the median 
is the value of the (n+1)/2 observation in the latter but the n/2 observation 
in the former. The omission of other averages commonly encountered in 
clinical and public health work is also noted. The use of logarithms to de- 
termine an average in bacterial counts, for instance, is so well known that a 
word about the geometric mean might have been in order. Also, the mid- 
range as an average in small samples from a normal distribution has suffi- 
cient efficiency to warrant its inclusion in the text. 

In Chapter V, the variation of observaticns around the central tendency 
is considered and the usual measures of scatter discussed. The meaning and 
use of the standard deviation in the normal curve at this early stage is a 
good teaching technique but the relationship between the normal curve and 
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the frequency distribution of stature is a point where the author might be 
more discriminating. The author knows full well that stature only resembles 
the shape of a normal curve for practical purposes and the normal curve has 
a probability origin quite apart from the frequency distribution of a physio- 
logical attribute. The text implies that if one measured stature in a large 
enough sample, the normal curve would result. 

The succeeding chapter is concerned with calculation of the standard de- 
viation by various methods and the correction for bias when estimating its 
value from a sample. 

Chapter VII is devoted to the problems of estimating an average from a 
sample. The exposition is splendid here and the only criticism is the one 
mentioned above, viz. the omission of Student’s major contribution for 
small samples. 

Starting with Chapter VIII, the text considers the analysis of qualitative 
attributes or data measured on a nominal scale. Early introduction of data 
of this kind is an excellent idea because of its frequent occurrence in health 
statistics. The first portion of the succeeding chapter deais with the problem 
of measuring and testing differences between proportions. The latter por- 
tion of Chapter IX considers the problerm of the difference between two 
means. Here the author states that to test such differences, one can calcu- 
late either a pooled standard deviation for the standard error of the dif- 
ference between means or alternatively base the latter value upon the in- 
dividual standard errors of the two respective means. The text is misleading 
by inferring from a particular example that “the two methods give closely 
the same resu/ts” and implying that this condition is true in general. To make 
matters worse, the author recommends the latter procedure when the two 
variabilities are unequal without any mention of necessary adjustment for 
levels of significance and testing criterion. If the samples are of different sizes 
and are drawn from two universes which differ radically i: their variability, 
the two methods do not even yield closely the same resi ts. 

The use of contingency x? is introduced in Chapter X without relating it 
specifically to the previous chapter for testing the difference between two 
proportions. The chapter starts off with a 2X4 contingency table and pre- 
sents the use of x* in a very understanding and logical fashion. The succeed- 
ing chapter is a continuation of x* with applications to contingency tables 
when the estimates of expected numbers are not necessarily based upon the 
marginal totals. The discussion of association in conjunction with the in- 
terpretation of a significant value of x? is well stated and one of the better 
features of the text. Special calculating formulas for the fourfold contingency 
table and Yates’ correction are also included in this chapter. 

The coefficient of correlation is considered in Chapter XII. Here, the 
least squares estimate of the slope of the regression equation of y on 2 is 
confused with the notion of correlation. This is one place where the text 
should specifically state that the formula proposed for the standard error 
of the correlation coefficient is for large samples only. Details for calculating 
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the coefficient of correlation are worked out in the succeeding chapter for 
ungrouped and grouped data. 

Chapter XV digresses from the usual statistical methods and concentrates 
on the construction of life tables with emphasis on their applicability to the 
problems of measuring survival after treatment such as for cancer. The 
treatment of this subject is well done for the short space devoted to it. 

The following three chapters are devoted to common fallacies and difficul- 
ties encountered in the misuse of statistics. The author is in his glory here for 
these pages are filled with jewels particularly useful for teaching purposes. 
In fact, two mistakes being made currently (November 1955) by advocates 
of poliomyelitis vaccine inoculations would have been averted if the statis- 
ticians (sic) had familiarized themselves with Bradford Hill’s hypothetical 
example on pages 180-183 entitled “Neglect of the Period of Exposure to 
Risk.” 

Age-adjusted or standardized death rates are discussed in detail in Chapter 
XVITi because one of the illustrations of the misuse of statistics in a pre- 
vious chapter points up the defect of comparing crude death rates of differ- 
ing population groups. Mortality and morbidity ratios are discussed in the 
next chapter. 

The final chapter is a general surnmary with final words of warning about 
significance and reasonable inferences. 

A valuable part of the text is the glossary of terms defined in the appendix. 
There are also exercises with answers for each appropriate chs ter of the 
book. 


Graphical Method of Statistical Inference, Seminar Note. Motosaburo Masu. 
yama. Tokyo: Maruzen Company, Ltd., 1954. Pp. ii, 83. Paper. $1.50. 


R. N. Brapt, University of Kansas 


= note on graphical methods in statistical inference is devoted to the 
uses of double square root, or binomial probability paper. We are pre- 
sented with an essentialiy new approach to the uses of double square root 
paper, one which includes in its scope the previous uses and, in addition, 
permits graphical treatment of the non-central distributions. 

The heart of the note is the third chapter; following it are four chapters of 
exercises and an appendix of answers and hints. The first chapter, “Intro- 
duction to the Logic of Statistical Inference”, is an exceedingly brief (26 pp.) 
survey of standard statistical techniques with references to fuller textbook 
treatments. If its faults were corollaries of its brevity, no comment would be 
in order; however, without the ballast of some knowledge in the subject, a 
reader might easily be led astray by such statements as “The positive 
square root of F with (1, f) degrees of freedom is called ‘Student’s’ t with f 
degrees of freedom”, or by such a discussion as is found in section 1.18, 
where we pass from the expression for the probability of at least k successes 
in N Bernoulli trials in terms of the incomplete Beta-function to the (un- 





174 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1956 


stated) conclusion that p has a Beta distribution and to the (stated) con- 
clusion that (VY —k+1)p/kq has an F distribution with (2k, 2(N —k+1)) de- 
grees of freedom. 

In addition to introducing the standard distributions and the basic ideas of 
estimation and testing hypotheses, the following diagram of the relations 
among the standard distributions is presented: 


tv’ ip F’ _— (x’)? 
Binomial = Beta = F > x? = Gamma = Poisson 
t Normal Exponential 


where ’ indicates the corresponding non-central distribution, a single arrow 
indicates inclusion as # special case, and a double arrow indicates the equiv- 
alence of the distributions. Taking the descriptions “equivalent” and 
“special case of” with a certain amount of salt, i.e., un7srstanding of the 
various senses in which they are used, this scherme provides a frame- 
work within which we can see the author’s purpose and accomplishment. 

The rationale behind the use of double square rovt paper for graphical re- 
presentation of various distributions has, up to nw, been to enter this 
diagram at the left end of the middle line througk th» arcsine transformation 
of the binomial distribution. Chapter II is devoted to a short discussion of 
this approach and of some of the problems which can be handled graphically 
thereby. The paper by Mosteller and Tukey, “On the Uses and Usefulness 
of Binomial Probability Paper”, in this journal (June, 1949), which is re- 
ferred to, attacks the problem via this route and provides a more complete 
discussion of this approach than the paper under review. Indeed, here is 
undoubtedly the genesis of the name ‘Binomial Probability Paper’ which 
Mosteller and Tukey chose. Masuyama has designed his own version and 
called it Stochastic Paper. They are not fundamentally different; the Bino- 
mial paper has a unit of 1.016 em. so that a 95 per cent confidence interval 
corresponds to +1 cm., whereas the Stochastic Paper has a unit of 1 em. and 
in addition has in the margins a scale for converting lengths to (one-sided) 
percentage points and two brief tables of | per cent and 5 per cent points, 
one for the distribution of R/o and one for x*/n. These tables, together with 
a choice of unit which makes it possible to extract roots with the aid of a 
ruler, would seem to give the Stochastic Paper a slight edge. 

The approach through the aresine transformation of a binomial variate 
does not, however, appear to lead to any graphical method for dealing with 
the distributions on the top line of the diagram, the non-central distributions. 
In Chapter III Masuyama develops a method for handling these distributions 
graphically. 

The first step is to show that if X, and X, are independently distributed 
as N(mi,o7), respectively, and Pr(X:<0) is small, then the distance (in 
centimeters) from the plotted point 


(25) (2) ) 
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to the random “split” (X1/X2)*; (o1/¢2)? has, approximately, a N(9,}) dis- 
tribution. Although the conclusion, as stated, is true, the proof presented 
requires that Pr(X:<0) be small as well. Then making use of the known 
normal apprcximations to the central and non-central chi distributions for 
large degrees of freedom, the distance from the point 


(1 +2Yh4 


to the random split f:F’ :f. has, approximately, a N(0,}) distribution where 
F’ is a non-central F variate with (f:, f2) degrees of freedom and eccentricity 
u (which is assumed to be small relative to f;). 

Now that we are firmly (though approximately) established in the graph- 
ing business for non-central F distributions, we are in business with all the 
other distributions listed in the diagram, since they are all “special cases”. 
The remainder of the chapter is concerned with making some of these 
specializations, illustrating their use in estimation, testing, and design, and 
with some remarks relative to the question of the accuracy of the approxima- 
tions, which appears to be satisfactory, at least for cases in which one con- 
siders using graphical methods to begin with. Because of the difficulties in 
obtaining tables of these distributions, approximations are almost a necessity 
in any case and it would seem that these graphical methods should be a use- 
ful addition to the statisticians’ tools. One reservation is that in certain of the 
applications it would seem that in order to plot the required points one would 
have to carry out a large part of the computations which would be needed 
in order to carry through a full numerical treatment, thus dissipating 
much of the advantage. 

There are a great many errors in the text, many of which are corrected on 
an accompanying errata sheet, though a good many still remain. The syntax 
is quite uneven. The drawings appear to have been made without benefit 
of drafting equipment. 


Tables of the Cumulative Binc mial Probability Distribution. Staff of the Compu- 
tation Laboratory. Cambridge, Massachusetts: Harvard University Press, 1955. 
Pp. lxi, 503. $8.00. 


f poe table carries the binomial distribution to far higher values of n than 
any of its predecessors, the values covered being 1(1)50(2)100(10) 
200(20)500(50)1000. In addition to the customary values of p by steps of 
0.01 to 0.50, a welcome feature is the provision of 10 additional common frac- 
tions, ys, Py, 4, 4, ve, Ys, 4, 8, Ay, and yy. Cumulative probabilities are shown 
to 5 decimals (not 6, as stated in the publisher’s advertisement in last 
September’s issue of this Journal). 

Except for the 10 extra values of p, values of n up to 150 are not covered 
as well here as in the Ordnance Tables, described in this Journal, Vol. 49 
(1954), p. 667. Thus, more than a third of the space is devoted to duplicat- 
ing (but to fewer decimals) results already available, but the duplication is 
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not complete enough to make this single set of tables sufficient. This volume 
and the Ordnance volume together, however, provide everything available 
in published tabulations of the cumulative binomial. 

The Tables are divided into 6 sections according to values of p, a some- 
what less convenient division for most statistical purposes than according 
to values of n. 

Perhaps the most valuable feature of the Tables is a 28-page introductory 
section on applications of the binomial distribution, by Frederick Mosteller. 
Seventeen well-selected examples are presented clearly. In fact, this is prob- 
ably the best and most comprehensive treatment of statistical applications 
of the binomial distribution available in the literature. 

An Appendix gives values of log» n! to 10 decimals for n = 1(1)1199. 

W.A.W 


A New Method of Analyzing Extreme-Value Data, NACA Technical Note 3053. 
Julius Lieblein. Washington, D. C.: National Advisory Committee for Aeronau- 
tics, 1954. Pp. 88. Paper. 


Braprorp F. Krupa, N. Y. Public Service Commission 


aes report describes a method of estimation of the parameters of the 
extreme-value distribution (doubly exponential) which constitutes an 
important step forward in the methodology of the estimation of parameters 
in curve fitting. The method follows the spirit of Mosteller’s suggestions 
concerning the use of “inefficient” statistics. The value of the report trans- 
cends its immediate application to the theory of extreme values in that a 
methodology is developed which may well point the way to a more practical 
approach to the estimation of significant parameters in other types of distri- 
butions. 

The first step is to devise an estimator of the parameter in question which 
is a linear function of “order” statistics such that 

a. The bias of such estimator is zero. 

b. The sampling variance of the estimator is a known function of the 

variances and covariances of the order statistics. 
The next step is to choose the coefficients of the order statistics so that the 
sampling variance of the estimator is minimized. . 

It turns out that this method is particularly felicitous as applied to the 
extreme-value distribution. The cumulative distribution function F(z) is 
given by 

P = F(z) =exp(-e), y= (z-—u)/B 4) 


where u and 6 are the parameters of the distribution and y is the intrinsic 
variable, sometimes called the “reduced variate”. From (1) it is clear that 
y can be determined by the value P of F(z). Denoting this relation by yp one 
can write 


Zp = & + Byp. (2) 
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Hence if parameters u and 8 are known the value of z is a linear function of 
yp. In using the extreme-value distribution, estimation of z for large values 
of P such as .90, .95, .99, etc., is particularly desirable. Lieblein chooses his 
primary parameter ¢ as 


gp = u + Byp (3) 
with P and hence yp given. He seeks to determine weights w; of the linear 
estimator 

L = > wX; (4) 
of the ordered sample values X; such that 

E(L) = tp (5) 


and that the variance of L be minimized. In order to solve this problem it 
is necessary to know the variances and covariances of the ordered sample 
statistics X;. These were investigated and a published paper’ indicates a 
method for their determination. 

In order t» gauge the efficiency of the estimator its variance is compared 
with the Cramér-Rao lower bound. Solutions are carried up to n=6, and a 
method of grouping larger samples so that they can be handled by means of 
the n=5 snd n=6 solutions is introduced. Lieblein remarks: “In view of 
the satisfactory values of efficiency, further calculation for n>6 did not 
appear warranted at this time”. 

A remarkable feature of the method is that the solution of the equations 
which determine the optimum weights w; (see (4) above) are of the form 


wz = a; + diye, i-=1,2,--++,n (6) 


where a; and °; depend only upon the sample size n. It can be shown to 
follow that this determination furnishes estimates of the original parameters 


t@=Lak, B= LX (7) 


with computable variances that indicate high efficiency. 

It should be noted that the above method in its entirety will apply to any 
estimation. pr*>lem where the cumulative distribution function is given 
as a known function F(y) of an intermediate variable y which in turn is a 
linear function of the observed variable z, involving coefficients which are 
the unknown parameters. 

The monograph is weil put together and is indispensable to those apply- 
ing the theory of extreme-values based on the doubly exponential distribu- 
tion. Lieblein has been quick to appreciate the possibilities of the order 
statistics’ approach and has done a thorough job. 

One might observe that in using order statistics one adds information 
which is very easy to obtain. The ordering of data is in general a simple 
process. Modern theory has been able to use this additional information. 
Lieblein’s monograph is an example of the added power of this approach. 





i Lieblein, Julius, “On the exact evaluation of the variances and covariances of order statistics...” 
Annals of Mathematical Statistics, 24 (1953), 282-7. 
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Measurement of Industrial Productivity. M. M. Mehta, Calcutta: The World 
Press Limited, 1955. Pp. ii, 98. 10s. 6d. net. 


Joun W. Kenpnricx, National Bureau of Economic Research 


penne since World War II, productivity is being defined and 
measured for the purpose of temporal and spatial comparisons as the 
ratio of output to the aggregate of corresponding inputs, both expressed in 
real terms. Despite the serious problems involved in using this concept, it 
has been considered superior to the usual “output per man-hour” or other 
“partial productivity” indexes as a measure of changes (or differences) in 
productive efficiency, since the partial measures are also influenced by factor 
substitution. Yet prior to 1946, practically all of the “productivity” indexes 
prepared in this country were output per man-hour measures. This was due 
not only to statistical expediency or inadequate theoretical clarification of 
the productivity concept, but also to concern with labor-saving as such, 
for which the labor input-output ratio is appropriate. 

The keynote of the movement towards measurement of “total productiv- 
ity” was struck by Hiram Davis at the first National Conference on Produc- 
tivity in 1946, and subsequently elaborated by him in two books concerned 
chiefly with the measurement of productivity in the firm. Attempts to meas- 
ure total productivity at the industry and national ievels in this country 
have been made by Stigler, Schmookler, Barton and Cooper, Fabricant, 
the present writer, and possibly others, within the past decade. 

The work by Mehta, written while he was associated with Davis at the 
University of Pennsylvania in 1952-53, is a selective, critical review of the 
literature on the concept, measurement, and interpretation of productivity 
comparisons, written from the “total productivity” viewpoint. Unfortu- 
nately, the book is poorly edited: frequently the English constructions leave 
much to be desired, and occasionally footnote references are lacking. But 
the author’s meaning usually comes through plainly. While it does not push 
forward the frontiers of thought, and handles some important matters in a 
cursory way, the book will be of use to the economist who seeks to acquaint 
himself with the growing body of productivity literature. . 

Since Mehta presents convincingly the case for relating output to total 
inputs, it is disappointing that he does not examine more closely the prob- 
lems of measuring the non-labor inputs—although he goes over some of the 
familiar problems involved in the estimation of output and of labor input. 
With regard to capital, he rejects ‘or sufficient reasons the attempt to trans- 
late capital services into man-hours; but then he also calls “untenable” 
the use of “plant-hours” proposed by Copeland and “deflated book capital” 
used by Stigler. Yot he does not elaborate his objections to these latter 
measures, nor does he suggest alternatives. He is apparently unacquainted 
with the more recent attempts to measure total productivity which have 
necessarily involved estimates of real capital services. 

Mehta is somewhat ambivalent as to the meaning of productivity com- 
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parisons, and he does not really come to grips with the problem of aggrega- 
tion. While he writes in one place of “intractabi« and insuperable difficulties” 
in construction of the indexes (p. 62), elsewhere he states that “the ratio of 
total output to aggregated input both valued at constant prices, undoubt- 
edly, furnishes a fairly reliable and satisfactory index for the measurement 
of temporal, spatial and cross-sectional changes in industriai productivity” 
(p. 88). The latter statement he does make dependent on the requirement 
“that some scientific and rational system of weightage be evolved . . . so 
that the differences in productivity arising from the variations in the produc- 
tion complexes could be easily ironed out” (p. 89). But since the index num- 
ber problem is indeed intractable, it would have been more fruitful if the 
author had discussed specific conventions in terms of their meaningfulness in 
measuring changes in productive efficiency. Instead, he merely warns that 
utmost caution must be used in interpreting the movements of productivity 
indexes. 

The author devotes a relatively fuli chapter to a descriptive analysis of 
the determinants of productivity changes. Mehta recognizes that, given 
the natural and human resource base of the economy, changes in the total 
productivity ratio are primarily a reflection of changes in technology, broadly 
interpreted. It is slightly confusing, therefore, to find him discussing the 
relationship of technology to productivity on a coordinate basis with factors 
such as size of firm, size of markets, age of plant, organizational factors, 
capital availability, and institutional factors. Although Mechta would like 
to see quantitative estimates made of “the influence of each individual 
determinant”, he recognizes that the causes of productivity change are so 
many, cemplex, and interrelated that primary reliance must be placed on 
deductive reasoning. In one case, Mehta has misivterpreted the evidence 
when he cites several studies as showing that “moderate reduction in hours 
of work has invariably been accompanied by an increase in output per 
worker” (p. 57). 

In his review of the uses of productivity indexes, Mehta’s enthusiasm for 
productivity analysis occasionally leads him overboard. Thus, it is question- 
able whether productivity indexes can aid significantly “in the selection of 
productive technique most appropriate in the prevailing economic and 
industrial conditions of the country” (p. 2). It would be more convincing if 
the author had presented evidence to support his assertion that “produc- 
tivity indices are now extensively used as objective measures for studying the 
distribution of the ‘fruits of production’ among all parties at interest” (p. 3). 
Mehta also claims that inter-industry productivity comparisons can be used 
to show whether resources are being transferred “from low to high produc- 
tivity industries” (p. 7). This reviewer agrees with Bratt, who is quoted by 
Mehta, that total productivity cannot be compared as among different 
industries in the same time period (if the factor prices prevailing in each 
industry are used to weight its inputs). 

Yet few will question the very real useful.» ss of productivity indexes, and 
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partial input-output ratios, for many of the important purposes which 
Mehta lists. Certainly the subject of productivity, as a central strand in the 
study of economic growth, deserves more and better work from both the 
theoretical and statistical standpoints than it has yet received. 


An Econometric Analysis of Construction. John M. Mattila. Madison, Wisconsin: 
University of Wisconsin Bureau of Business Research and Service, Wisconsin 
Commerce Reports, Vol. IV, No. 1, April, 1955. Pp. 85. $1.15. Paper. 


F. E. Batperston, Carnegie Institute of Technology 


I THIS econometric study, undertaken avowedly in the Cowles Commission 
spirit, Mattila develops and tests a highly aggregated linear model of the 
United States construction industry. The industry is divided into three 
sectors: private residential construction (considered endogenous to the 
model), private non-residential construction (also endogenous), and public 
construction (considered exogenous). The data used for estimation of 
parameters cover the period 1919-50, excluding the war years 1942-45, 
inclusive. After a single supply equation and two demand equations are de- 
fined for the construction industry, closure of the model is effected by means 
of a pair of equations: a disposable income-multiplier equation, and a non- 
construction net private investment equation. 

The supply equation says that the total amount of construction (in con- 
stant dollars) is a linear function of the ratio of materials prices to construc- 
tion costs, of the ratio of construction wage rates to total construction costs, 
and of a random disturbance term. The author was aware that somewhat 
different supply conditions might characterize the two endogenous sectors of 
the construction industry, but a single equation was chosen in order to 
simplify the model. Parameter values show that total output is an inverse 
function of costs and that output is approximately twice as sensitive to 
changes in materials prices as to changes in wage rates. 

On the demand side, it is shown that private residential construction is 
quite income-elastic (the elasticity coefficient is +3.28 at the mean value of 
United States disposable income) and also that it is quite sensitive to changes 
in the ratio of rents (chosen as an approximation to the capital cost of housing 
units) to construction costs. The latter statement cannot be interpreted to 
mean that the primary demand for shelter is price-sensitive (the model is 
addressed directly to the net demand for construction and does not approach 
this through an examination of the demand for shelter and then of the 
derived demand for new housing). The predetermined stock variable—the 
constant-dollar value of all residential structures, taken at the beginning of 
each period—has negative but small influence on the demand for new private 
residential construction, and net family formation has positive but small 
influence. As the author points out, the algebraic values of the elasticities for 
the several variables run according to our expectations in every case. 

The results are less convincing in the equation for private non-residential 
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construction demand. This variable is a function of business expectations and 
of the available stock of non-residential structures. The former was originally 
represented by the ratio of current corporate profits to construction costs 
and (in an additional term) by the same ratio, lagged one year. In this form, 
the author found substantial multicollinearity between the current profits- 
costs ratio and the stock-of-structures variable. The equation was therefore 
reduced to make business demand for construction a function of the lagged 
ratio and of the available stock. This formulation leaves a good to deal be 
desired. 

Estimates of parameters were made by the limited-information, maximum 
likelihood method, equation by equation. In every case, a standard error of 
estimate was computed for each coefficient (the coefficients were generally 
found to be significant with 95% confidence or better). In addition, the 
author computed a coefficient of correlation for each equation; in general 
these were in the range, 0.6-0.9. Doubts as to interpretation of all of these 
results is aroused, however, by the presence of substantial serial correlation 
in the random disturbance variables pertaining to all of the equations. From 
this fact may emerge a few unhappy inferences concerning the conduct of 
individual research in econometrics. 

The presence of serial correlation probably traces to the extreme degree 
of aggregation that was required to bring the amount of computation down 
to what could be done by one man operating a desk calculator. The small 
total number of endogenous and of lagged variables left substantial room for 
non-random disturbance effects. The implication is that this problem could 
be avoided only by expansion of the equation system—but this would push 
the computational task beyond the resources of one researcher. Before sta- 
tistical methods were developed for the more refined testing of significance 
and for the handling of large model-systems, it was no doubt very desirable 
to have numerous individual exploratory efforts, especially when the 
alternative was to have no progress at all. Perhaps the time has now come 
for the abandonment of strictly individual econometric effort in the formula- 
tion and testing of macro-economic models. 

The above remarks seem particularly pertinent because Mattila has done 
a careful and praiseworthy job with his problem, has no doubt pushed his 
own personal resources to the utmost, and yet has produced results which, 
as he points out, contain a vexing measure of undependability. 

If the formulation of a larger model could be contemplated, it would be 
possible to draw more heavily upon the considerable state of general knowl- 
edge of the private construction sector: Sherman Maisel’s extensive work on 
the general demand function for private housing in his doctoral dissertation; 
work by the same author on the costs and efficiency of different sizes of 
residential construction firm; Richard Scheuch’s analysis of labor supply 
and labor efficiency in construction; the work of Grebler and his associates 
on the rental housing market; and Colean and Newcomb’s investigations of 
non-residential construction demands, costs, and output capacities. 
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The present study, however, contains both a set of statistical results and 
a sound discussion of the derivation of the equation-system and of its sta- 
tistical reliability. As such, it deserves the attention of those who are working 
in this field. 


Experience in Urban Real Estate Investment. Leo Grebler. New York: Columbia 
University Press, 1955. Pp. viii, 277, $9.00. 


A. B. Hanpuer, University of Michigan 


HERE durable geods and particularly buildings are concerned, our 

knowledge consists to a surprising «xtent of the rule-of-thumb, the crude 
approximation, and various tidbits of ancestral lore. It is, for example, 
practically a folk-myth that rent is aud should be 20 per cent of family 
income, that property maintenance is 1-2 per cent of “value”, that invest- 
ment in real estate is alinost riskless as well as highly profitable. “Normal” 
housing vacancy rates have become conventionalized at 5 per cent, and 
building depreciation rates at 2-3 per cent of original cost. Here the average 
plays the same role as the statement of absolute truth in prescientific modes 
of thinking. It is a summary of experience, unequivocally stated and readily 
grasped. It is not necessarily wrong—only unverifiable because stated in 
unconditional terms. 

That this state of affairs has persisted so long where real property is con- 
cerned is due partly to the conceptual apparatus utilized and partly to the 
difficulty in obtaining manageable information; and each of these has acted 
as a brake and thwart upon the other. With the possible exception of certain 
aspects of housing, penetrations through the barrier of informed opinion 
have been few and limited in scope. In this context Grebler’s exploration 
into the profitabllity of urban real estate as an investment is a venture of 
the first magnitude. Its importance lies not in the general applicability of the 
results obtained. For it cevers only 312 cases (comprising 581 properties) 
confined to New York City, and largely to Manhattan. It lies rather in the 
techniques developed for thrusting through this particular wilderness. 

The study seeks to answer the question: How profitable has investment 
in urban real estate been, and how has it compared with other forms of 
investment? To this end it examines the income, expenses and net returns of 
a sample comprising office buildings, lofts, taxpayers, and 4 types of resi- 
dential structures during the period 1900-1950. Only some of the data 
encompass the whole period; most extend over two to four decades. The 
study proceeds in a straightforward manner through analyses of: (1) the 
general course of income and expenses as well as comparative movements by 
type of structure and kind of expense; (2) changes in operating ratios of 
expenses and net income to gross earnings by type of structure; (3) changes 
in the return on capital; (4) comparison of net returns with investments in 
mortgages and in bonds. In addition a special attempt is made to measure 
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debt charges against net income; and brief excursions are made into the rela- 
tive effects of vacancy losses and rent changes on income, the profitability 
of slum properties, and changes in income-asset ratios. 

The methods used and the solutions to the numerous technical problems 
arising during the investigatic1 are meticulously and lucidly explained. In 
many respects the analysis of return on capital is technically the most inter- 
esting part of the study. Recognizing that each measure of net return has 
its own purposes and limitations, four are developed: ratio of net income to 
acquisition cost first on an annual basis and second as an average for the 
experience period; ratio of outstanding to original investment at assumed 
interest return, and allocation of remaining net income to amortization; 
realized yields, i.e., those which take into account the terminal value of the 
asset as well as annual net income. In not using depreciation rates in any of 
these calculations the author here as in the rest of the study consistently 
eschews the conventional and hews to the factual. 

The results depict the period to 1930 as a “golden age” of real estate in- 
vestment, during which time the existing folklore was developed and main- 
tained in the teeth of the sharply contrasting subsequent experience. The 
profitability of real estate investment is shown to have heen much more 
volatile and variable than investments in urban mortgages and corporate 
bonds. Net returns, too, have been “lower than one would expect considering 
the risks involved in real estate investment”. Even after World War II 
earnings failed to approximate previous high levels. However, evidence is 
cited in footnotes indicating that in other cities 1950 income exceeded 
previous high levels. 

Despite the tenuous nature of the results (which are carefully and properly 
qualified throughout), this study is a model of its kind. The author, I am 
sure, will feel amply rewarded if it stimulates further sorties into this area 


Trends in Consumer Finance. M. R. Neifeld. Easton, Pa.: Mack Publishing 
Company, 1954. Pp. xiv, 142. $6.00. 


Henry I. Kester, University of Washington 


rirends in Consumer Finance contains a wealth of statistical information 
describing the operating trends of the consumer finance companies in the 
United States for the years 1938-51. Neifeld has collected data from the 
annual reports of the state administrative agexzies where the Uniform Small 
Loan Laws, or their equivalent, are in effect. In addition, he has solicited 
help from the officials of these regulatory agencies and other trade sources 
in making special compilations and skillful estimates to fill in gaps in the 
data. Accordingly, this single volume organizes and presents much statistical 
information which heretofore has been unavailable for this industry. 
Simplicity of presentation is a strong virtue of this book. After a brief 
discussion of the historical development of the industry, the author presents 
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time series of annual data on both a national and state-by-state basis for 
year-end loans outstanding, number of licensees, average amount of loans 
outstanding, volume of loans made, principal collected, number of loans 
outstanding, bad debts, recoveries on charge-offs, charges earned but not 
collected, charges collected, and ratios of net earnings to assets. Information 
on the average size of offices, statutory loan rates, distribution of expenses, 
security for loans, composition of assets, financial structure of ten large 
companies, break-even size, and characteristics of borrowers is also pre- 
sented for selected years. Many fine charts are used to show trends and other 
operating characteristics of the industry. A brief analysis of the data pre- 
sented in each chapter adds much to the understanding of these trends in 
the consumer finance business for most of the past two decades. 

Failure to add the word “companies” to the title of this otherwise fine book 
is open to criticism on two counts. First, unnecessary confusion may have 
been created among those who think the term “consumer finance” synony- 
mous with the much broader term “consumer credit”. And, second, the author 
did not seize a real opportunity to help establish the title of “consumer 
finance companies” as the more appropriate one in reference to the sniall loan 
industry. In recent years “consumer finance companies” has been used 
widely in both the literature and discussion of this business. Moreover, this 
title is considered by many to be more appropriate to describe both the 
functions and operations of an industry providing a small cash instalment 
lending service to consumer borrowers. 


Despite these minor criticisms, Neifeld has made a real contribution to the 
literature of the field. His systematic organization of statistical information 
will facilitate research for those interested not only in the major operating 
trends in the consumer finance industry, but also, many other specific 
research projects in this field. 


Business Concentration and Price Policy. A Conference of the Universities-Na- 
tional Bureau Committee for Economic Research. Princeton: Princeton University 
Press, 1955. Pp. x, 514. $9.00. 


Warp 8. Bowman, Jr., University of Chicago 


HIs volume, containing twelve separate papers relating to measures of 

concentration, to price policy and other evidences of business behavior, 
and sometimes to the relationship between them, makes available the papers 
and the comments of a conference held in Princeton, New Jersey in June 
1952. ; 
Much of the volume is devoted to critical review of work previcusly under- 
taken in each of the subjects under review. New information or analysis is 
also included or important subsidiary issues such as tax effects on concen- 
tration, vertical and conglomerate integration, and mergers. The central 
problem posed by the conference, the relationship of the structure of in- 
dustry to its behavior, raised difficult preliminary issues involving a de- 
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termination of what a concentrated structure is, what it should be, and how 
it can be measured. Satisfactory norms of business behavior. from which 
departures occasioned by the unique effects of concentration may be de- 
termined, presented an equally difficult question. Much of the volume is 
devoted to these issues. - 

The first four of the twelve papers are focused on the question of what 
concentration is, where useful empirical data is to be found and how it may 
be evaluated. In the first paper, “Census Principles of Industry and Product 
Classification, Manufacturing Industries”, Conklin and Goldstein indicate 
how similar products and production processes of plants are combined into 
an industry. Particular attention is on supply conditions. In economic terms 
the rules are intrinsically ambiguous. For purposes of studying economic 
concentration in any meaningful sense in a monopoly-competition context 
(and for other purposes) feasible methods of detecting close substitution in 
both supply and demand need to be applied in determining what is an in- 
dustry. Stigler emphasizes this point in his introduction to this volume. 

“Measures of Concentration,” by Gideon Rosenbluth, contains a review 
of the main classes of indexes which have been used to measure concentration 
in industrial structures. He then evaluates the factors which should go into 
the choosing of an index and provides an exaruple of the use of cross-section 
data by comparing the general level of concentration in manufacturing in- 
dustries in Great Britain and the United States. The problem of the scope 
of an index (apart from its form) is then discussed. He concludes that we 
do not yet know enough about the behavior of the various concentration 
indexes to be able to choose between them in investigating any given prob- 
lem. Carefully formulated hypotheses regarding the joint effects of a number 
of variables is a prerequisite stressed by Rosenbluth. 

The monograph by Tibor Scitovsky, “Economic Theory and the Measure- 
ment of Concentration”, details the effects that an economic theorist might 
expect from concentration, and puts these to use first in criticizing existing 
measures and then in attempting to formulate better measurement as a 
means of testing oligopoly power. Scitovsky stresses the essential difference 
between measures of monopoly, usually conceived of as an effect of this con- 
dition, and measures of concentration, which only tell of its likelihood. He 
concludes that economic theory cannot offer much help, except in a negative 
way, in choosing among alternative measures of concentration. “Oligopoly 
theory”, he says “does not tell us the maximum number of firms among 
which competitive behavior will still be oligopolistic”. 

In “Measures of Monopoly Power and Concentration: Their Economic 
Significance”, John Perry Miller adopts an approach which stresses the 
sterility of “statics” and the need for “dynamics” in the Schumpeterian 
manner. Measures need to be designed to take account of non-price factors. 
Emphasis would be placed on product differentiation, flow of new investment 
and rates of technological change. 

The foregoing studies, relating to the general subject of measurement, 
emphasize the paucity of accepted findings and the plenty of debated issues. 
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Prospective entrants into this field of study may be discouraged. Perhaps 
they will be challenged. If they are this material should prove useful in 
avoiding pitfalls. 

Three of the papers included in this volume deal with important forces 
affecting concentration: “Survey of the Evidence and Findings on Mergers”, 
by Jesse W. Markham; “Survey of the Empirical Evidence on Economics of 
Seale”, by Caleb A. Smith; and “Effects of Taxes on Concentration”, by 
John Lintner and J. Keith Butters. 

Markham, after pointing up serious shortcomings of merger research to 
date (its non-theoretical nature, definitional problems and biases, statistical 
ambiguities in studies of the impact of mergers on concentration, and the 
absence of emphasis on firms not merged), then summarizes cyclical aspects, 
finding that mergers are only weakly associated with economic fluctuation. 
Correlation between mergers and wholesale prices does not confirm the 
hypothesis that mergers are motivated by declining prices. There appears to 
be a closer relationship between mergers and stock prices than with other 
indexes, but the relationship is precarious and no causal connection is sug- 
gested. Markham proceeds to analyze each major merger wave separately. 
He concludes that relatively few mergers appear to have had monopolization 
as their goal, that the most important single motive for mergers at the peak 
of merger movements seems to have been promotional profits, that many 
mergers are prompted by ordinary business transactions involving entry and 
exit, and finally, that some mergers come about as adjustments to major 
ninovations. The available historical series of mergers given by Markham 
are described by Stigler as—“sketchy in the extreme”. He points te the 
desirability of undertaking the vast task of recompiling the historical record. 

Caleb Smith’s study of economies of scale shows clearly how difficult the 
problems of measurement are in this area. The dangers of generalization are 
stressed. The principal difficulty is that the observed phenomena do not 
conform to the theoretical constructs. According to Smith, there is no homo- 
geneous product for which cost comparisons can accurately be made for firms 
of different sizes. Milton Friedman in commenting on the paper points up 
an even morc basic difficulty—* . . . theory itself gives no reason to expect 
that cross-section data will yield the relevant cost curves”. 

The study by Lintner and Butters on the effects of taxation on concentra- 
tion distills the results of their work in this field. Lintner and Butters find 
that the high rates of the corporate income tax in recent years have tended 
to increase the levels of concentration among business firms. The effects are 
of moderate proportion. This conclusion emerges from a careful appraisal of 
the tax effects on five principal factors: formation and early growth of new 
firms, management incentive to growth, relative rates of growth through 
retained earnings, availability of outside funds for the expansion of large 
and small firms, and the effects of taxation on mergers. A reasonable 
hypothesis is tested and an unambiguous conclusion is reached. Although in 
only one of their categories have they demonstrated this quantitatively, 
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Daniel Holland, their commentator, believes that the qualitative analysis 
does not call for a conclusion of “moderate effects”. This, he holds, leads to 
the mistaken notion that tax effects are powerful in theory and weak in 
practice. 

Three additional studies in the volume focus attention on various aspects 
of pricing behavior. “The Nature of Price Flexibility and the Determinants 
of Relative Price Changes in the Economy”, by Richard Ruggles, relegates 
concentration to a subordinate role (confirming Alfred Neal) and emphasizes 
the major importance of direct cost changes in explaining price changes. 
Another paper, “Full Costs, Cost Changes, and Prices”, by Richard B. 
Heflebower, reviews the development of the analysis which de-emphasizes 
or rejects marginal cost analysis seeking to replace it with a special theory 
peculiarly adaptable to markets characterized as being neither mons;~wlistic 
nor competitive. Major accomplishment here in terms of a convincing case, 
either theoretically or empirically, is yet to come. A third study on pricing 
practice, “Characteristics and Types of Price Discrimination”, by Fritz 
Machlup, elaborately classifies the subject matter with picturesque detail. 

A central public policy issue remaining to be resolved is suggested by the 
Machlup classification. This involves finding whether a particular use of 
price discrimination is “play the game” (a means of maximizing existing 
monopoly power) or “kill the rival” (a means of creating new monopoly 
power). There is neither theory nor evidence which allows the reader to judge 
the relative importance of these types. This reviewer is unaware of any care- 
ful study showing how “kill the rival” would create monopoly, or in fact that 
it has. Citation of evidence has been almost solely restricted to the old 
Standard Oil case. Is it a myth? Again, more study is needed. 

Two additional papers, one on vertical integration by M. A. Adelman and 
another on conglomerate bigness by Corwin Edwards round out the report 
of the Princeton conference. Each of these papers analyzes a problem the 
relationship of which to monopoly or to competition is both more remote and 
more controversial than is horizontal integration. Adelman does not attempt 
to deal with this complicated correlation problem here. He is concerned with 
the preliminary task of finding acceptable and unambiguous measures of 
vertical integration. After noting many deficiencies, Adelman emphasizes the 
relative merits of two measures: the ratio of income originating in the cor- 
poration (or industry) and its sales, and alternatively, the retio of inventory 
to sales. 

Corwin Edwards’ study of conglomerate bigness deals with a different kind 
of problem. The term conglomerate becomes s device for examining problems 
of size and power apart from th: traditional focus upon monopoly and ef- 
ficiency, according to’ Edwards. What is examined, however, as Stigler points 
out, is often consistent with well-established theoretical propositions. There 
has been much criticism of efforts to “prove” power by evidence of absolute 
as contrasted to relative size. Edwards’ hypothesis is, that irrespective of 
market percentages, size does matter. There are an infinitely large number of 





188 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1956 


possible combinations of substitute, complementary and apparently un- 
related conglomerations. The analytical difficulties presented here are even 
greater than those which have been stressed in detail with respect to hori- 
zontal concentration. 

The volume as a whole is not fairly evaluated in terms of the definitive 
answers it provides. A “work-shop” conference is a useful and necessary step 
in a long and complicated task. The effect of business concentration ou 
pricing policy and marketing relationships is still a distant goal. In a field in 
which theoretical concepts are widely disputed, and where relevant material 
is meager, no disparagement should be inferred from s judgment that it is 
too early to capitalize expected rewards. 


A Methodological Study of Migration and Labor Mobility in Michigan and Ohio 
in 1947. Donald J. Bogue. Oxford, Ohio: Scripps Foundation for Research in 
Population Problems, 1952. Pp. 100. $1.00. Paper. 


Nataurz Rogorr, Columbia University 


wo disparate bodies of research dealing with the mobility of persons in 

the labor force are by now clearly discernible: the one contributed by 
economists and iemographers analyzing “labor mobility”, the other by 
sociologists concerned with “social mobility”. On the whole, the two sets of 
investigations have proceeded remarkably free of mutual interpenetration. 
Yet each has come independently to have the identical relationship with the 
theory to which it is linked. By providing empirical measurements of move- 
ment among positions in the occupational structure, long-held assumptions 
about mobility have been shown to be incorrect or, at best, oversimplified. 
The present volume takes its place squarely in the mainstream of this trend. 

Making use of available data concerning the employment histories of 
persons covered by Social Security benefits, the study analyzes changes of 
industry, of employer, and of location of employment (i.e., migration) for 
a one per cent sample of the Ohio and Michigan (insured) labor force. Bogue 
deais with mobility over a shorter run than usual, for his data cover the four 
quarters of the year 1947 only. 

The findings are of three types. First, over-all mobility rates are given 
(e.g., “In Michigan and Ohio, about one-third of all workers either changed 
counties, changed industries, and/or changed employers during 1947”). 
There follows a series of chapters on mobility differentials when workers are 
distinguished according to sex, race, age, industrial affiliation, consistency 
of (covered) employment during the year, and annual earnings. The relative 
effect of these factors (with the exception of race) is then examined in a 
multiple-variable analyisis. 

The ingenious use of readily accessible data is one of the more striking 
aspects of this study. An introductory chapter is devoted to an almost com- 
prehensive discussion of the Social Security records from which Bogue’s 
sample was drawn. However, one variable is totally ignored in the entire 
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volume—occupation. Insured workers’ occupations may not, of course, be 
recorded in the Social Security files; if this accounts for the omission, a word 
of explanation might have been offered in the light of the central role fre- 
quently given to occupation in the study of mobility differentials. 


Research on Labor Mobility: An Appraisal of Research Findings in the United 
States. Herbert S. Parnes. New York: Social Science Research Council, 1954. 
Pp. xi, 205. $1.75. 


Maraaret 8. Gorpon, University of California 


NYONE who has ever attempted, for his own purposes, to summarize the 
findings of the numerous labor mobility studies of the past few decades 
will appreciate the value of this volume. The difficulty, as Parnes points 
out, is that research on mobility problems has been “extremely diverse with 
respect to specific hypotheses, research design, and methodology.” Thus it 
is not easy to arrive at a judgment as to whether apparent differences in the 
findings are attributable to differences in the time period or locale of the 
studies or to variations in technique. 

Parnes threads his way through these difficulties with considerable skill. 
His study, which is one of several sponsored by the Labor Market Research 
Committee of the Social Science Research Council in recent years, will un- 
doubtedly be used extensively in courses dealing with labor market problems 
and will be “required reading” for research workers planning mobility studies. 
Statisticians will be particularly interested in his chapter on methodology, 
in which he discusses concepts and definitions, the relative merits of alterna- 
tive sources of mobility data, and the findings of studies which have under- 
taken to test the reliability of work history information obtained from 
interviews. 

Although Parnes’ coverage is extensive, the scope of his study is neverthe- 
less limited in several important respects. As the author himself points out 
it deals with labor mobility only in relation to the flexibility of the labor 
supply, largely ignoring its relationship to problems of personal adjustment 
or to the class structure of society. Thus the chief emphasis is on studies by 
economists, while relatively little attention is paid to the literature emanating 
from other disciplines, particularly the extensive sociological literature on 
mobility. Although these other studies have quite different frames of refer- 
ence, they overlap with the economists’ studies, particularly in relation to 
inter-occupational movement and the key problem of motivation. Parnes’ 
study would have helped to meet the need for more effective interdisciplinary 
communication in relation to mobility research had he broadened his cover- 
age in this respect, although his project might well have become unmanage- 
able. 

The limitation to American studies is likewise regrettable, though 
thoroughly understandable. The need for more attention to European 
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mobility studies is suggested by the fact that doubt has recently been cast 
on the widely accepted notion that there is less social mobility in Europe 
than in the United States.! 

In spite of these limitations, Parnes’ study fills an important need. He 
succeeds in drawing useful distinctions between those findings which may be 
regarded as firmly established conclusions and those which must be regarded 
as tentative hypotheses in need of additional testing. His final suggestion, to 
the effect that future progress in mobility research will “depend on the careful 
adaptation of research designs and methods of analysis to specificproblems” 
may prove to be especially important if, as seems likely, economic develop- 
ments of the next decade or so require a change in emphasis in this field of 
research. It may well be that the emphasis on problems of manpower scarcity, 
which has dominated much of the mobility research «f the last fifteen years, 
will have to give way to a greater concern with the problem of achieving full 
utilization of labor resources in the face of rapid tecknological change. 

1 Cf. S. M. Lipset and R. Bendix, Ideological Equilitarianism ani: Social Mobility in the United 


States, Reprint No. 76, (Berkeley: University of California, Institute of Industrial Relations, 1955), 
p. 34. 





The Labor Force in War and Transition, Four Countries. Occasional Paper 36. 
Clarence D. Long, New York: National Bureau of Economic Research, Inc., 
1952. Pp. 59, $1.00. Paper. 


A. J. Jaren, Columbia University 


ABoR Force experiences of four belligerents of World War II—United 
States, Great Britain, Canada and Germany are compared. The emphasis 
is on the extent to which the size of the employec population, by sex, ad- 
justed itself to the needs of this wer. In estimating the increas in the numbers 
of emploved, the author took into consideration b<*! the amount of popula-: 
tion gro «th which occurred during the war per’: 4... | ve amount of unem- 
ployment prior to the war. Analysis of change: ler. n of the work week 
is also included. 

His conclusions, in brief, are to the effect that the Unived States made the 
best showing. After allowing for population growth, he found that the 
proportion of the American population in the labor force at its war-time 
peak was 8.2 percentage points greater than just before the war—62.3% as 
compared with 54.1%. In Canada, the proportion rose by 6.8 percentage 
points, and in Great Britain by 4.7 percentage points in Germany, however, 
there was very little addition to the work force, even if foreigners and slave 
labor are included. 

The U.S. analysis, by and large, is not new. The facts about the other 
countries, however, are less well known in the U.S., so that the comparative 
analysis has intrinsic merit, and the paper is highly stimulating. The basic 
problem in such comparisons, however, revolves about the question of just 
how comparable the basic statistics are. Essentially, the analysis was de- 
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signed around the labor force statistics which the U.S. has; if an Englishman, 
for example, had designed the analysis around the British data which would 
be available to him, he probably would have designed the study somewhat 
differently. As a result of the non-comparability of the basic data, the author 
was forced to adjust them somehow 80 as to achieve some measure of 
comparability. 

However, it is well known that differences in the collection procedures can 
lead to quite different statistics. Hence, it is difficult to know exactly how 
to interpret small differences which may appear in the final analysis. For 
example, the difference between the U.S. and Canadian increases was 1.4 
points. Was there really a greater increase in the U.S., or can this difference 
be attributed to differences in the way in which the basic data were collected? 
Of the four countries, the data for Germany seem to have been of the 
poorest quality; is this, perhaps, one of the reasons why the size of the Ger- 
man work force did not seem to increase? 

The author shows that much of the increase in the working forces of the 
three Allied countries was due to the influx of women. Hence, he investigates 
this point in some detail. One interesting idea presented in this connection 
is that the number of women available for participation in the working force 
is a function of the number of people living in private households containing 
two or more persons. In the U.S. in 1940, the ratio of houseworkers to popu- 
lation in households was .228; in 1950 the ratio was almost identical. During 
the years of World War IT, many women became available for work simply as 
a result of there being fewer families because the men entered the Armed 
Forces. 

This is a ratio which should be examined in much greater detail than has 
been done; in particular it should be related to the number of young children, 
one of the most important factors in determining whether or not women are 
available for participation in the U.S. labor force. Before very much further 
use can be made of the ratio of .228 we should know whether or not it is a 
constant, and if not, under what conditions it changes, and in what manner. 

In trying to account for differences among the countries in the increase in 
their working forces, the author calculated the allotment and allowance 
benefits which were paid to the families of lower rank enlisted personnel, 
in each of the countries. These allotments were expressed as a ratio of the 
disposable monthly income per equivalent adult male. These ratios show 
that U.S. and Great Britain, which had the largest increases of women ia 
the working force, paid the smallest relative allotments. Canada and Ger- 
many, which had the smallest increases of women in the working force, paid 
relatively much higher benefits. Therefore, the author infers that in the U.S. 
and Britain, women “were obliged to work in order to live”, whereas in 
Canada and Germany, “a wife could get much of her ration withor.i work- 
ing” (p. 43). Without this hunger incentive, then, women tended not to 
enter the working force. 

The figures and ratios presented may be correct. One point overlooked, 
however, is that in the U.S. at least, the additional money sent home by 





192 AMERICAN STATISTICAL ASSOCIATION JOURNAL, MARCH 1956 


men in the Armed Forces, above that officially allotted, was very great. 
American soldiers received far higher pay than the soldiers of any other 
country, and a very considerable portion of this was sent to their families. 
If such income had been added to the official allotments there is considerable 
question whether U.S. women “were obliged to work in order to live”. Why 
then, did they enter the labor force? 


A Policy for Skilled Manpower: A Statement by the Council, with Facts and Is- 
sues Prepared by the Research Staff, National Manpower Council. New York: 
Columbia University Press, 1984. Pp. xxi, 299, Biblicgraphy. $4.50. 


Ciarence D. Lone, Johns Hopkins University 


ey interesting volume seeks to develop policies for increasing the supply 
of skilled personnel to the industries and armed forces of the United 
States. The demand for skilled labor, far from shrinking as a result of the 
revolution in technology over the last cent:iry, has probably been increasing 
and may well continue to expand in the future; for while advancing tech- 
nology dissolves many complex operations into simple and routine steps 
calling for ne specialized competence, it also sets up the need for new kinds 
of skilled craftsmen—to plan and make the models cf new machinery, and 
to produce, service, control, and maintain the new equipment. The authors 
look for an increasing use of skilled and technical workers in the development 
and pre-production stages of industrial operations, in assisting in medical, 
educational, research, and other professional areas, and in improving the 
quality of work done in the services fields. 

What can be done to meet this growing demand? This. question can best 
be answered by first examining the characteristics of the present supply of 
skilled and technical labor, and the channels through which the skills and 
techniques are acquired. 

The typical skilled worker—identified as an individual of distinctive 
abilities, particular competence, and specia! training—is slightly older than 
the average labor force member, has an education which seldom consists of 
less than 8 years or more than 12 years of schooling, has 2 or more years of 
specialized training, and is a white male—few skilled negro workers are 
found except in the hard, dirty, skilled jobs, many of them with unstable 
employment. Skilled workers acquire their techniques in four ways: through 
apprenticeship; through informal training in the shop; through vocational 
training in high school, night school, or correspondence school; and through 
training in the armed forces. 

Only a small percentage get all their training as apprentices. The bulk of 
skilled workers obtain their training informally, through work experience, 
promotion, and incidental training—a process of recruitment that is neces- 
sarily hit-or-miss and that is impeded by the various kinds of discrimination 
practice 1 against women and minority groups. Formal vocational training 
as part of a high school education has been especially important in the 
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United States. The proportion of youths 14-17 in high school has grown from 
3.8 per cent in 1890 to 65.3 per cent in 1952—of these, about a million youths 
in general high schools take federally-supported courses in trade, industry, 
home economics, and agriculture, and over half take at least one commercial 
course. Such training has the advantage of being more subject to control 
than the informal training methods. Moreover, a general high school 
education provides the foundation for a better vocational education. Thus, 
what the authors find that is right and wrong with high school training 
merits our attention. 

Their appraisal yields a mixed verdict. On the one hand, they give little 
support to the wildly heralded claims of deterioration in the quality of the 
high school curriculum and its graduates. On the other hand, they advance 
a number of strictures concerning the vocational training which high schools 
offer—a training which many employers and local union representatives also 
view without much enthusiasm. Any program for improving vocational high 
school education should recognize that the general qualifications of the 
worker-.-proper attitudes, general intelligence, good reading, writing, speak- 
ing, and arithmetic sbility—are more important than special aptitudes or 
skills. (At least this was the advice of the majority of the employers with 
whom the authors conferred.) If, however, a vocational program is to be 
stressed, the best arrangement is probably to have the student spend half his 
time in school and the other half at paid work under actual production con- 
ditions (though it is hard to interest employers in such a program). The 
authors recommend better teaching; a common program of reading, writing, 
speaking, and mathematics through the secoad year of high school; speciali- 
zation in vocational subjects only in the third or fourth years; opportunity 
for the better vocational students to go to college; more participation of 
Negroes and females; and special employer and labor advisory committees 
to keep the vocational program effective. 

No policy for skilled manpower should overlook the skilled labor programs 
of the armed forces, since three-fourths of our boys eventually enter the 
military service. A good proportion of these recruits receive an expensive 
training but inasmuch as the great majority stay in the service only a short 
time, the benefit to the armed forces is small in relation to the cost, and the 
training is usually too specialized to meet the needs of the worker in later 
civilian life. The armed forces are urged to reduce this waste by minimizing 
the turnover of personnel (through better pay and longer service require- 
ments), by training broadly for a future career instead of narrowly for a 
service specialty, and by making fuller use of civilian schools and civilian 
instructors in providing this broad training. 

This review has not said anything about statistics. The reason is that it is 
impossible to appraise the book from the standpoint of the primary interest 
of readers of a statistical journal; as the volume itself points out, it is a 
qualitative treatment of a policy question. Its excursion into statistics is 
confined to a few general figures such as those from the census on the skill 
composition of the labor force, and to a suggestion that the universities and 
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the government should encourage the accumulation of more information on 
our skilled manpower resources. The book could have been organized some- 
what better, and not very many of the ideas or suggestions struck me as 
being particularly novel. Nevertheless, I found it well worth while. It is very 
informative, and its judgments and proposals are thoroughly sound. A careful 
reading by the huge body of persons who control, or participate in, the train- 
ing of our skilled and technical workers would do much to bring about the 
adoption of a more effective policy for skilled manpower. 


Communism, Conformity, and Civil Liberties. Samuel A. Stouffer. Garden City, 
New York: Doubleday & Company, Inc., 1955. Pp. 278. $4.00. 


8. B. Wirney, University of Michigan 


Whe book is a very lucid presentation of a large amount of data obtained 
on two identical and simultaneous national surveys on the topical area 
of the book’s title. In this research tour de force, involving the National 
Opinion Research Center of the University of Chicago and the Gallup 
organization, almost 5,000 respondents were contacted, after being selected 
by probability sampling procedures. Also, 1,500 leaders (14 types, arbitrarily 
but precisely defined) were interviewed in towns in the 10,000 to 150,000 
population range. The field work was conducted during the Army-McCarthy 
hearings in May-July, 1954. 

The report is well documented with almost 150 charts, tables and figures, 
an unusually clear and lengthy account of procedures and their rationale, a 
full presentation of research inst.*''_..11ts, response rates and so forth and the 
work is quite well integrated with other work on the topic. 

The book attempts to do little more than present the state of affairs of 
public thinking. For one book this is a sufficient task—especially since much 
that is reported will be surprising to many and its clear presentation pro- 
hibits a brief development. 

The greater tolerance ef leaders is first developed on individual questions 
and then on a scalar measure. Concern, interest, and threat measures are 
then presented focused on the topical area. The tolerance measure is then 
related to age, education, a “rigidity” measure, a “conformity” item, and an 
“optimism of outlook” question; also to geographic location, population 
concentration, sex, church attendance, and known governmental actions 
about the communist threat. Other chapters deal with the aspects of com- 
munism that are distrusted, the perception of the kind of people who are 
communists and the amount of actusl contact with such people. The final 
chapter covers the author’s ideas on implications for action. 

Two bothersome results are discovered. Women are less tolerant than men 
and church goers are less tolerant than their fellows. No really good explana- 
tion of these relationships is appare1it. Some interesting and imaginative 
hypotheses are developed that would explain a large amount of the data, 
such as: tolerance is a function of amount of contact with people with dif- 
ferent ideas than one’s own; but this idea can be only tested assumptively— 
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e.g., men, metropolitan dwellers and the better educated have more contacts 
of this sort, therefore, they should be more tolerant. They are. 

It is a pity that with this wealth of data (the results from almost all ques- 
tions are reported) the analysis was not carried to a deeper or more complex 
level. It is hoped that such analyses will be forthcoming as time permits them 
to be made. 

It is also a weakness of the study that the interesting, independent psycho- 
logical variables such as “rigidity”, “conformity (in child rearing)” and 
“optimism of outlook” were the only ones included in the questionnaire and 
each one, as a measure, is based on only one question. The questionnaire is 
already a long one, however, and pressures against further lengthening must 
have been strong. 

Stouffer is to be praised for his thorough and clear presentation of his 
findings and the committee backing his efforts is to be commended for the 
collection and presentation of these revealing facts about the American 
public on this much debated and much misunderstood issue. Only the curious 
researcher is likely to wish that more questions had been asked or answered 
or feel that the analysis should have gone into the more complex areas of 
explanation. 


Social Area Analysis. Eshref Shevky and Wendell Bell. Stanford Sociological Se- 
ries, Number One. Stanford: Stanford University Press, 1955. Pp. vi, 70. $1.75. 


Rosert W. Buecu.ey, California Department of Public Health 


E authors state the major purposes of their contribution to be (1) “to 
restate the basic orientation and to specify the steps in the construct 
formation and index construction of this method of urban analysis” and (2) 
“to demonstrate the use of the typology as an analytic framework for the 
comparative study of the social structure of American cities”. 

The typology and index construction were first developed and described 
by Shevky and Williams in Social Areas of Los Angeles, which should be read 
by anyone interested in urban analysis. The present contribution restates the 
construct formation in very brief form. Three factors, Social Rank, Urbani- 
zation and Segregation, are extracted from a consideration of the changes 
in society brought about by the shift from a rural-agricultural to an urban- 
industrial basis. Indexes, designed to measure these factors, are selected and 
computed from the statistics given for census tracts in the U. 8. Census 
bulletins. The index of Social Rank, based in the Los Angeles study on occu- 
pation, schooling and rent, has been revised to omit rent, which cannot be 
computed from 1950 data for tracts of owner-occupied homes, and which, 
as a reflection of economic level, is affected seriously by rent controls. 
Urbanization continues to be based on fertility ratio, per cent of women who 
are at work, and the per cent of single family dwelling units. Segregation is 
based on the per cent of the population belonging to relatively isolated racial 
and national groups. Specific procedures are given for the computation of 
index scores for each of the factors. These scores are standardized to a divi- 
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sion into 100 parts of the range of scores found in Los Angeles, 1940. A 
revised division of social space into quarters on Social Rank and Urbaniza- 
tion, giving sixteen divisions, is made. 

The revised computational procedures and divisions are used to compute 
and map the social areas of the San Francisco Bay region, 1950. The com- 
puted scores for 1940 and 1950 are given in detail, and the sixteen possible 
areas are delineated on handsome bright colored maps. There is some dis- 
cussion of the reiationship to the indexes of the variables not entering into 
their computation. Sex ratio and age percentages are singled out for study, 
as being other important characteristics of tract populations. This analysis 
of the San Francisco bay area is not exhaustive, “The purpose is to demon- 
strate the use of the typology as a framework for the comparative study of 
the internal differentiation of American cities, ... ”. 

Much more remains to be done to make this type of analysis generally 
accepted and used. On the level of theory, Robinson’s criticism, that the 
area is not the individual resident in the area, is still valid. In the analysis 
of Chicago mental illness, for instance, Faris and Dunham found that the 
atypical resident of an area was a much higher risk for some types of insanity 
than the typical resident. Other criticisms on theory remain to be made, 
including a query about the number of degrees of freedom properly used to 
test correlations between variables by tracts in a city. 

Another feature of the analysis that might be adversely criticized is the 
approach to the constructs by ratiocination, rather than by more exact 
statistical methods. Bell, however, states that he has shown that factor 
analysis results in factors identifiable with the constructs, and he further 
states that Tryon, working independently, has found equally easily identifia- 
ble factors for the San Francisco bay area. Others have found similar factors 
for other American cities, though not making the claim that they are to be 
found generally.’ Little doubt remains that any approach, using the materials 
of the Census, will return similar results. Whether the Census has exhausted 
the dimensions in which populations can vary is a further question, beyond 
the scope of a review. 

A criticism on a more specific level is on the revision of the division of 
social space to resemble a Cartesian grid. The original skewed grid warned 
the casual observer and continually reminded others that the indexes of 
Social Rank and Urbanization as they were plotted against each other were 
correlated. While it is seldom possible to find the effects of variables com- 
pletely separated in nature their measures should be orthogonal, not merely 
graphed as such. The extreme measures taken to correct timepieces for 
temperature change are a case in poiat. The authors recognize this problem, 
but have found that the relationships vary between cities and thus cannot be 
standardized. Their solution is to “free” the index of urbanization “from 
conformity to the shape of any particular distribution of urbanization and 





1H. F. Gosnell and M. J. Schmidt, “Factovial and Correlational Analysis of the 1934 Vote in 
Chicago,” Journal of the American Statistical Assoviation, 31 (1936), 507-18, 1936. 
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social rank ... ”. This amounts to ignoring the correlation, which is hardly 
the best solution. 

Another criticism is the standardization of Social Rank and Urbanization 
to Los Angeles, 1940. While some standard seems to be needed, this par- 
ticular standard gives peculiar results when applied to the San Francisco 
area, 1950. Of the sixteen cells, four are completely empty, one contains one- 
half a tract, one contains two tracts, and five whole tracts and four half 
tracts are outside the boundaries of the grid. Fahrenheit had the same trouble 
when he standardized his thermometer to the range of temperatures as he 
found them in East Prussia: Moscow and Madrid simply did not fit into his 
grid. 

The tedious methods for computing scores must also be revised if the sys- 
tem is to be applied to any very large number of cities. The index of Segre- 
gation requires that 17 items given in the Census bulletin be summed. This 
sum is then divided by the population to give the index. Urbanization is 
based on a standardization of the mean of three components, each of which 
is a quotient, and one of which, Fertility, requires summing six items. Social 
rank requires the standardization of the sum <i two components, each a quo- 
tient, one of which requires the sum of four edv’<; ‘tion classes, and the other 
the sum of three occupation groups. These components are found in sepa- 
rate tables of the Census bulletins, and a complete transcription and compu- 
tation procedure must be set up to grind out the indexes. “Laborious” is the 
only appropriate word to describe this: procedure. 

Lest the criticisms above be taken to mean a disapproval of the entire 
contribution, the results of an actual use of the typology should be cited. 
Using the scores, and areas, for the City and County of San Francisco (made 
available in preliminary form by Bel!) the reviewer and John E. Dunn, Jr., 
M.D. have analyzed cancer mortality for the years around 1950. We have 
found that the populations of the various social areas differ significantly in 
the kinds and amounts of cancer, and we are preparing to undertake further 
analyses, using Shevky and Bell’s typology. 

In summary, a plan of analyzing the American city into social areas is 
developed and justified, and an example of an analysis is given. The re- 
viewer considers that further work is necessary on the theoretical, statis- 
tical and computational aspects of such analyses before they can be con- 
sidered standard methods. The resu'ts of this particular analysis have, how- 
ever, been found very useful in studying cancer mortality for San Francisco, 


Sociological Studies in Scale Analysis. Matilda White Riley, John W. Riley, Jr., 
and Jackson Toby. New Brunswick, New Jersey: Rutgers University Press, 1954. 
Pp. 433. $6.00. 


Kurt Lane, Canadian Broadcasting Corporation 


HIS volume is devoted to documenting the usefulness cf the cumulative 
scale (Guttman model) for the analysis of “collective” data. ‘The authors’ 
basic assumption, with which no one can quarrel, is that items completely 
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distinct from the usual attitude data may be fitted into a scale model and 
their unidimensionality thus validated. Further, this grouping of items along 
a single dimension, whenever possible, may facilitate the test of hypotheses 
for which an analysis of items taken individually might not have been 
suitable. 

In terms of the contents of this volume, this means that the authors and 
a number of contributors, in a mutually compatible fashion, examine various 
aspects of scales constructed out of sociometric data. Part I of the book is a 
general introduction. Part II introduces the “object scaie”, which, stated 
briefly, treats all the sociometrie choices for a single person as attributes of 
that person. The number of choices—irrespective of the subjects making 


them—received by each “object” for a given category serve as the items for - 


the scale. In Part III, the application of the cumulative scale is extended to 
“dyadic” relationships, i.e. a subject-object pair. Such a dyadic scale is built 
either out of several items for which a given subject names (or fails to name) 
a particular other, or it may make use of a series of reciprocal “choices”. In 
either case, the hypothesis that the various categories of “choice” between 
any two persons form a scale pattern is testable, and the prospect is raised 
that fruitful hypotheses about group structure are inferable. 

Finally Part IV deals in some detail with procedures—for speeding up the 
construction and increasing the reproducibility of scales. Some papers, such 
as one contributed by Robert N. ford, describe procedures applicable to 
scale construction in general, whereas others prepared by the authors show 
how these can be carried over into sociometric analysis. Three of the papers 
indicate ways of improving scalability, either by the H-Technique of con- 
triving items or by image analysis using the Israel Alpha Technique. Lack- 
ing altogether, although the autho:s show awareness of the problem, is a 
discussion of criteria for scalability other than reproducibility. 

A good bit of the text consists of either reprints or elaborations of previous 
publications. Nonetheless, the combined experience of the Rutgers group 
(plus the few outside papers) makes a most useful manual on methods em- 
ployed in analyzing sociometric materials, illustrating in particular the var- 
ious possibilities for ordering diverse data inherent in scalogram techniques. 
But the authors would have the book be more than that. To them, it re- 
lates the story of how “scale analysis was bent to the task of translating 
vague concepts into precise variables” (p. 5). Most of the new materials are 
aimed at showing correspondence between a conceptual model and its mathe- 
matical description as a cumulative scale. 

The distinction between “mechanical” and “organic” models is one of the 
topies approached by way of scalagram analysis. Whenever a two-person 
relationship proves scclable even though the subjects and objects that are 
part of the dyad do not scale, this is taken as prima facie evidence that there 
is a mutual contingency in their actions which, according to an internally 
differentiated “organic” model, is not revealed in the similarity of the traits 
of subjects or objects. Nonetheless, the authors’ illustration of how such a 
scale is constructed does not prevent its utility in the analysis of group 
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structure from remaining obscure and programmatic. A paper contributed 
by Uriel Foa illustrates how the sealing of a series of attitude items toward 
specified objects, obtained from the subjects in place of a single “choice”, 
may allow us to test some interesting hypotheses about conflict, derived from 
the higher components of such scales. In the subsequent chapter, the authors 
test some ideas about “~applementary” relationships which are seen to de- 
pend on group structure. Applying a finding from Whyte’s Street Corner So- 
ciety—a person will be respected simply because he is the friend of a re- 
spected person—, they show how in an empirically derived dyadic scale 
“being wished for as an associate” is harder than “being popular.” It seems, 
however, that a simple test of association between popularity ratings and 
status of associates (made on p. 258) is a more straightforward appreach to 
the same question. 

Extension of the cumulative scale to many areas is in itself to be welcomed, 
even if only as a means for grouping an unwieldy mass of items. There is 
much suggestive material on this score. On the other hand, the “operationali- 
zation” of concepts describing complex social systems is not achieved by 
scale-analysis, even of dyadic data. No amount of mathematical manipu- 
lation of sociometric materials can substitute for first-hand observation 
undertaken in response to a clearly conceptualized problem. 


Social and Psychological Factors Affecting Fertility, Volume Four. P. K. Whelp- 
ton and Clyde V. Kiser, Editors. New York: Milbank Memorial Fund, 1954. Pp. 
285. $1.00. Paper. 


Grorez F. Marr, Smith College 


HIS volume, which brings to twenty-four the number of reports on the 

“Indianapolis Study” to be made available in collected form, continues 
the gradual unfolding of a mass of data gathered in 1941 and 1942. In- 
tended as a means of expanding knowledge of the correlates of human fer- 
tility and as an exploration of some aspects of the methodology of sociz! 
research, the Study was designed as a test of twenty-three hypotheses 
formulated in advance by the committee in charge. Each hypothesis stated 
an expected relationship between some social, psychological, or economic 
variable, on the one hand, and the effectiveness of family planning and the 
size of planned families, on the other. Volume I of the collected reports of 
the Study, published in 1946, described the preliminary survey of Indianap- 
olis by which the couples to be interviewed were located. Volume IT (1950) 
reported on the planning of the Study, the collection of the data, and cer- 
tain characteristics of the respondents, and began the presentation of find- 
ings with respect to individual hypotheses. Each hypothesis, or closely re- 
lated group of hypotheses, was treated as a separate project, and the Study 
data were combed to discover the nature and extent of the relationship be- 
tween the hypothesis variable and fertility. Volume III (1952) continued 
this series of investigations. 
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The fourth volume, here under review, extends the series further, ex- 
amining ego-centered interest in children, fear of pregnancy and childbirth, 
and economic tension. In addition, it branches out from the original list of 
hypotheses in two directions. First, there are reports on several hypotheses 
not included among the initial twenty-three. These concern the size of 
family in two successive generations, and inter- and intragenerational social 
mobility. Second, the -eries of reports on individual hypotheses is inter- 
rupted to present an analysis of the use, effectiveness, and acceptability of 
methods of contraception, and a study of the intercorrela‘ ions of selected 
hypothesis variables. The latter is especially welcome. Although it is com- 
paratively brief and serves primarily to reaffirm the already clear dominance 
of economic over social and psychological variables in the Indianapolis data, 
the article constitutes at least a beginning toward the process of bringing to- 
gether the many fragments into which the analysis is now divided. 

Like its predece:sor, this volume must be read in the light of the papers of 
Vo'ume II that describe the sample and the basic analytical categories of 
the Study. Its value will be greatest to those able to make their own judg- 
ment of the extent to which the various conclusions are supported. It is a 
leading characteristic of the Indianapolis data that many of the relation- 
ships among the variables are present only to a mild degree. This often places 
upon the reader the burden of deciding whether a particular result is sig- 
nificant. For example, in the article on inter-generational social mobility, 
the family size of mobile couples is compared with that of the non-mobile, by 
occupation. If the clerical group is omitted, a relationship is found that would 
be expected on a chance basis only five per cent of the time (p. 978). The 
fact that inclusion of the clericals alters the probability to twenty-five per 
cent is mentioned only in a footnote—but it raises a question as to whether 
the omission of that group is an important limitation on the significance of 
the result found. 

As has been the case throughout the Indianapolis Study, the seven papers 
gathered in this volume present a variety of problems interesting from a 
methodological viewpoint. Among those discussed at some length, and not 
yet satisfactorily resolved, is the measurement of the effectiveness of contra- 
ception. Although the authors use the so-called “effectiveness ratio” as one 
of their chief instruments of analysis, they are in considerable doubt as to its 
adequacy and devote an appendix to an exposition of some of the difficulties. 
In this connection, the argument of pages 950-51 might have been strength- 
ened by reference to the “ineffectiveness ratio” proposed in the main body 
of the text. Some of the apparent insensitivity of the effectiveness ratio to 
changes in the base upon which it is calculated would thus be overcome. 

At this stage in the publication program of the Indianapolis Study it may 
be appropriate to offer a comment or two on the Study as a whole, with an 
eye to its implications for future projects of this sort. It is by now doubtful 
that so extensive a range of topics as were included in the list of twenty-three 
hypotheses should be approached in a single study. Rather often in the anal- 
ysis it has developed that the number of cases available was not adequate 
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a comprehensive coverage of world agriculture with consideration of “Agri- 
culture in the World Economy”; “Land, Farms and Farming”; “Food 
Crops”; “Technical Crops”; “Live Stock and Animal Products”; “Forests 
and Forest Products”; and “Fisheries”. Part IV, “Energy and Mining” con- 
tains about 230 pages covering the chapter headings “Mining in the World 
Economy”; “Metallic and non-Metallic Minerals”; “Coal”; “Petroleum and 
Natural Gas”; and the “Economics of Energy and Power”. Part V, “Manu- 
factures”, contains an additional 220 pages covering “Manufactures in the 
World Economy”; “The Food, Drink and Tobacco Industries”; “The Tex- 
tile Industry”; “The Iron and Steel Industry”; “The Machinery and Trans- 
portation Equipment Industries”; and “The Chemical Industry”. The vol- 
ume also includes some 36 pages of bibliography classified by chapters, and 
author and s:bject indexes. 

In view of the tremendous range of subjects covered and the vast body of 
quantitative material included it would be unbecoming for this, or any re- 
viewer, to indicate selected areas in which to the reviewer’s personal knowl- 
edge the data or treatment might be improved. Yet it remains an obligation 
to make the following critical observations: 

First, the volume is uneven in its coverage, statistics and treatment. 

Second, the work lacks integration and despite its presentation as a book 
or treatise is perhaps more useful as an encylopedie reference volume. 

Third, each of the parts in this volume might well nave been a separate 
book and as such would have profited from ar additional investment of time 
and energy to produce a more unified and analytical text. 

Fourth, the textual material is primarily descriptive a™d offers little in 
the way of a support of, or contribution to, theory to the fields with which 
it is concerned. 

These criticisms add up perhaps to the general observation that if an inte- 
grated treatise on the subjects covered was the objective, the Woytinskys 
have really undertaken to do more than any two-person team could possibly 
achieve with the kind of resources and time which were at their disposal. 

The reviewer wishes that he could say that the delay in the writing of this 
review is attributable to the time required completely to read the volume. 
Most readers, like this reviewer, will never read all of it. But statisticians, 
demographers and economists will find this a highly useful reference work to 
have handy. It brings together in one readily accessible place a mine of ma- 
terials which otherwise can be reached only through extensive searches and 
often in foreign languages which are unknown to most American scholars. 

On balance, there is no question but that demographers, economists and 
statisticans are greatly indebted to the Woytinskys for this volume and for 
an example of what can be accomplished by two exceedingly competent 
people who really know how to work. 
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latory and other chronic diseases, the hospitalization of children and of 
aged persons. 

The limitations of these statistics are carefully pointed out. They are the 
result of a six-month experiment only and are highly selective in regard to 
the type of patients covered, primarily incigents in New York City. Hence 
the figures cannot be expected to be representative of hospital morbidity in 
general. The average stay, for instance, ia the general hospitals under study 
was as high as 16.6 days. The study also covers tuberculosis, cancer, psy- 
chiatric, chronic disease, and communicable disease institutions. 

Among the many interesting factors brought to light by the study is the 
wide variation in the experience even between hospitals of similar type, under 
common auspices and central control. The tables indicate that the use of 
home care is widespread in these hospitals (only one of the fourteen general 
hospitals reported no home care services), and the impact of age and sex on 
the demand for hospitalization are pointed out. 

The recommendations contained in the last chapter, although specifically 
directed to the administration of hospitals in New York City, have wider 
general interest. The authors discuss here further questions of methodology, 
euch as sampling versus complete enumeration, suggested classifications for 
age and length of stay, and particularly also cost estimates. 

In all, an interesting demonstration of what can be done with hospital 
morbidity statistics and a very valuable contribution to the methodology of 
similar projects. This study also provides an answer to the old question of 
whether we should have routine hospital morbidity statistics or not: if the 
purpose is to measure general morbidity in the community, the answer is 
doubtful, but for administrative purposes the answer is an unqualified yes, 


Método y aplicaciones del Muestreo Estadistico. Ana Maria Flores. Departa- 
mento de Muestreo, Mexico, Julio, 1954. Pp. 54. 


Srxtro Rios, Instituto de Investigaciones Estadisticas, Madrid 


HE Secretarfa de Economfa de la Direccién General de Estadfsticia de 

Mexico (Departamento de Muestreo) has now reproduced a set of lec- 
tures delivered by Ana Maria Flores, as a contribution to training of tech- 
nicians of Departamento de Muestreo and other Mexican agencies. 

It is not a book about sampling, but a collection of lectures whose main 
topics are: Sampling, Sampling Surveys in Mexico, The Importance of 
Statistics in Medical Research, Probability Calculus in morbidity research. 

The general style is descriptive and in some way propagandistic, but 
probably appropriate for the audience to whom the lectures were given. 

Although the lectures are non-mathematical, the omission of such notions 
as confidence intervals reduces the clarity with which the true significance 
of sampling is conveyed. 

There are some minor faults, for example, the assertion (p. 14) that 
Neyman is English and Fisher American, and the omission of Cochrans’s 
book from the bibliography. 
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PUBLICATIONS RECEIVED 


Abbott, Lawrence. Quality and Com- 
petition—An Essay in Economic The- 
ory. New te lumbia University 
Press, 1955. $3.7 

Senge Fisheries and Food, 
of. Domestic Food Consump- 
tion po ———— 1953, Annual 
Report of the National Food Survey 
Committee. London: Her Majesty’s 
Gapetanery Office, 1955. 4s. Paper. 

Bochner, Salomon. Harmonic Anal- 
jae and the Theory of Probability. Ber- 

eley: University of California Press, 
1955. $4.50. 

Sloe Executives’ Research 
Group in cooperation with the Whar- 
ton School of Finance and Commerce, 
Uaiversity of Pennsylvania. The Stra- 
tegic Role of ‘eget, Philadelphia: 
1955. $1.00. Pape 

Chicago, City ot Office of the Hous- 
ing and Redevelopment Coordinator. 
Residential Construction and Related 
Data—Chicago Metropolitan Area, 
1964. Chicago: 1954. No price. Paper. 

Cuzzort, eee P. Suburbaniza- 
tion of Service Industries Within Stand-’ 
ard Metropolitan Areas—Studies in 
Population Distribution #10. Oxford, 
Ohio: Scripps Foundation, Miami Uni- 
versity and Population Research and 
Training Center, case aed of Chi- 
cago, 1955. $1.50. hy 

Dayton, Scott. A Manual o, p se Prob- 
lems in Statistics— Revised 
New Tomk: Henry Holt and Co., 1955. 
$1.95. Paper. 

Diesslin, Howard G. Agricultural 
Equipment Financing, Occasional Pa~ 
Paper #650, Financial Research Pro- 
gram. New York: National Bureau of 
Economic Research, 1955. $1.25. Pa- 


per. 

Dornbush, Sanford N., and Schmid, 
Calvin F. A Primer of Social Statistics. 
New York: McGraw-Hill Book Co., 
Inc., 1955. $4.75. 

Edwards, Allen L. Statistical Meth- 
ods for the Behavioral Sciences. New 
York: Rinehart and Co., Inc., 1954. 


$6.50. 

Fisher, Waldo E. and James, 
Charles M. Minimum Price Fixing in 
the Bituminous Coal Industry. A Re- 
port of the National Bureau of Eco- 
nomic Research in cooperation with 
the Wharton School of Finance and 


Commerce, University of Pennsyl- 
=. ceton: Princeton Univer- 
Press, 1955. $10.00. 


Ro 
in Collective Bargaining. Princeton: 
Princeton bao og Industrial Re- 


io a a 1955. Tce eee lacie 


and’ Scher, Stanley. Brien and Price 
Differentials on Over-the-Courter Mar- 
kets. Philadelphia: Universit ioe Penn- 
"Geni 1955. $1.25. 
AP . of the Poses of iB “ 
r 0, 
rado 3 Economics Boulder: 
of Colorado 1955. 
$1.50. Paper. 


Herdan, G. Statistics of Thera 
Trials. a, Texas: Elsevier 
Inc., 1955. $10. 
it Sian oh Alle Renmsien, Life In- 
surance Fact Book—1955. New York: 
= Free. Pa 
tion for Re- 


Associa 

ph. . in Income and Wealth. Income 
and Wealth-—Series IV. London: 

— and any aM. an . 
wa, Evelyn and Bogue, 
Donald J. Suburbanization of Manu- 
facturing Activity Within Standard 
—— Areas—Studies in Popu- 
ian Benen #9. Oxford, Ohio: 
Scripps Foundation, Miami Univer- 
say Peguietton Research and Train- 
Center, — of Chicago, 


1955. $1.80. Paper. 
Leftwi Se Richard HL. The a oe 
location. 


tem son Resource A 
ay Rinehart and Co., Inc., 1955. 
50. 


H. F. British Incomes and 


Pio ny Institute of Statistics Mono- 


graph #5. Oxford: Basil Blackwell, 
1955. 32/6. 

Milbank Memorial Fund. Current 
Research in Human Fertility. Papers 
presented at the 1954 annual confer- 
ence of the Milbank Memorial Fund. 
New York: 1955. $1.00. Paper. 

Mills, Frederick C. Statistical Meth- 
ods, 8rd Edition. New York: Henry 
Holt and Co., 1955. $6.95. 

National Bureau of Economic Re- 
search, by the Conference on Research 
in Income and Wealth. Input-Output 
Analysis: An Appraisal—Studies in 
Income and Wealth, Vol. 18. Princeton: 
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THE STATE OF FOOD AND AGRICULTURE 1955 


Review of a Decade and Outlook 
Rome, September 1955 
xi + 236 pages, numerous graphs and tables. 
US $2.50 or 12s. 6d. 
; atin te pondecsssnse, Cie, yest’ staettcm Ga siete of Seed oGh eaneeteees Bess mee not deal mainly 
ith the current situation and short-term outlook, but the progress and experiences 
(ithe whole decade. The treatment of this theme is analytical rather sther than descriptive and an attempt 


bring out the under causes of and 
results achieved. main issues and yp a LP 


. 


Il (im the same volume) reviews the production, consumption and trade developments com 
’ commodity Tccheline Gecacen ond Tekeinc ss"lie Tn ee sence ch oe dakar 





YEARBOOK OF FISHERY STATISTICS 1952/53 


Part |\—Production Part Il—Trade 


A basic reference work on world trade in agri- 
cultural products statistics of the imports 
and exports of the commodi- 
ties. It includes 

gn and 
‘or some commoc 
trade season as well as by 





Price: Each part $3.50 or 17s. 6d. 





YEARBOOK OF FISHERY STATISTICS 1952/53 


Part |—Production and Craft Part Il—International Trade 
cn production and piven more detailed daia The statistical tables give the same data as in 


oe at, Se years, but have been fol- 
i Fic Rh By Bw By Clas- 


contains a 
from 1910 to 1953 teshen down by species 
countries. Price: Each part $3.00 or 15s. 





YEARBOOK OF FOREST PRODUCTS STATISTICS 1954 


Tile resstenty, a ciatehs bs 0s etn ee ee eee 
oupateuasd, aan rn Soko cee a el 
Price: $2.50 or 12s. 6d. 





MONTHLY BULLETIN OF AGRICULTURAL ECONOMICS AND STATISTICS 


ae unity Vite ©: nee ~~. to-date statistics on ali 
major aspects of the world’s food and agriculture erat ake trade and prices. 
Data for the most important commodities, ¢.g., wheat, three times a 
year, and data for other commodities once or twice a year according eke (eeamionan Mich inoue 
contains articles, commodity notes and statistical ta! 
Editions in English, French and Spanish. 


Annual Subscription: $5.00 or 25s. 
Single issues: $0.50 or 2s. 6d. 





CATALOGUE OF PUBLICATIONS 
FAO Documents All FAO publications are available 
in U.S.A. from FAO's sales agent: 


COLUMBIA UNIVERSITY PRESS 
International Documents Service 
2960 Browdway, NEW YORK 27, N.Y. 


Please mention the Journal of the American Statistica, Association in writing advertisers 
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OPERATIONS RESEARCH 


Stimulating assignmenis in 
your specialty, contact with fine engineering and 
scientific minds, an informa) academic atmosphere make this a porition 
particularly interesting. Y u'll work on unusual projects at Ithaca, New 
York with the General Electric Advanced Electronics Center at Cornell 
University. 
To qualify, a backgromnd in statistics, linear programming or game theory 
is necessary, plus the ability to conduct operations research on industrial 
problems and organizations and/or military systems. Formal education 
should include a Ph.D. in Applied Mathematics 
or Engineering. 


_— — 
= — 
=e e, _ 
"3 aoe 
—_— — — 
—_ ; eos a — 
—_— Please address inguirics to: ee ee, 


Dr. Paul Doigan 
Advanced Degree Personnel, GE Advanced Electronics Center 


GENERAL @® ELECTRIC 
at Cornell University, Ithaca, New York 














AN INTRODUCTION TO 


STOCHASTIC PROCESSES 


hy M. S. Bartlett 

A= survey of the new mathematical and statistical tech- 

niques for studying stochastic processes (any processes that 
have some statistical or chance element in their structure, such as 
the phenomenon of Brownian motion, the growth of a bacterial 
colony, or the fluctuating numbers of electrons and photons in a 
cosmic ray shower), 
As processes of this kind are of universal occurrence, the book will 
interest a wide range of specialists. The general mathematical 
methods deveioyed in the early chapters form the basis for any 
applications, physical or biological. 


$6.50, at your bookseller’s 
CAMBRIDGE UNIVERSITY PRESS 
32 East 57th Street, New York 22, N.Y. 











Please mention the Journe! of the Amenican Statistica, Association in writing advertisers 




















For 1956 
Frederick C. Mills’ 
AN INTRODUCTION TO STATISTICS 


Excellently adapted to the one-semester course, this text is an abbreviated, 
simplified version of Professor Mills’ widely used Statistical Methods. 
Fully up to date, it retains the clarity and scholarship of the larger volume. 
Practical applications are illustrated throughout by realistic examples. 





William J. Fellner’s 
TRENDS AND CYCLES 
IN ECONOMIC ACTIVITY 


A systematic treatment of business cycle problems. Working the analysis 
of cycles into the theme of economic growth, the book covers the general 
area of dynamic economics. 


Important Holt texts in statistics 
STATISTICAL METHODS, Third Ed. 


STATISTICAL INFERENCE 


FIRST COURSE IN 
PROBABILITY AND STATISTICS 


STATISTICS FOR 
SOCIOLOGISTS, Rev. 


ELEMENTARY STATISTICS, Rev. 


MATHEMATICS ESSENTIAL FOR 
ELEMENTARY STATISTICS 





Henry Holt and Co. 
New York 17 ¢ San Francisco 5 








Please mention the Journal of the Ammnican S 
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GRAPHIC FACTS dramatically and NOW—with Chart-Pak-— 
everything you need is printed on rolls 
al oat ne 

ou can 
are dramatic facts be taken ap as quickly 

average charting time 
far less than other methods. And 
secretary can do it. 








And you can make graphs 
yourself quickly, easily 
the cost of the 


save 
Kit with your first chart 
. » witheut ever touching @ pen! 

















— Arnold W. Johnson's— 


PRINCIPLES OF 


AUDITING 


Including a chapter by John Neter on the 
use of statistical methods in auditing 


A thorough, modern treatment emphasizing internal control. Cites 
the latest authoritative thought on such matters as net income, in- 
ventory, current assets, etc.; contains representative working papers, 
audit program. ($6.50) Johnson’s Case Problems in Auditing 
($4.00) available for use with this text at the special combined 
price of $8.50. 


» Coming soon: Tre new revised edition of Johnson’s ELEMENTARY 
ACCOUNTING 





Write for examination copies 


232 Madison Ave. RINEHART New Yor’. (6, 1.¥. 


Please mention the Journal of the Amunican Statisticat Association in writing advertisers 
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now has available 
One Long-Range Career Opening in 
OPERATIONS ANALYSIS 


STATISTICIAN-ENGINEER 
Dutieg include; statistical ssn “atarne analysis of 


Inventory ‘=e See fa ae tf 
tion of production, ons we arene cael aiinen eae be 
od acai aoe Spr ae st apo 
esired qualifications: mathematical statistics 
ion science (ingiading wn tg with x + with and 
fa Se ees 
a eae on anmt p= or probability 


submit @ detailed résumé, 
ne Se feet. UPAR: 


Mr. J. C. Costelle, Jr.. Engineering Department 


E. |. du Pont de Nemours & Co., Ine 
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THE JOURNAL OF INDUSTRIAL ECONOMICS 
Three issues yearly Subscription post free 
‘An Anglo-American alliance of economists devoted to this subject is liable to 
be stimulating’——The Manchester Guardian. 
Contents of last number 
I. P. Andren Monopoly Investigation and Methods for Calculating the Rate 
of Return on Capital Employed. 
J. L. Meij Some Fundamental Principles of a General Theory of Man- 
agement. 
Clarence Jung Investment Decisions and the Non-Linear Cycle. 
Michael Beesley The Birth and Death of Industrial Establishments: Experi- 
ence in the West Midlands Conurbation. 
wv’ David Gordon Tyndall Price Policy in a Nationally Administered Industry—an 
Alternative View. 
J. A. P. Treasure Direct and Indirect Exports. 
CO. Roberts Labour Productivity and Utilization in Capitalist and Co- 
operative Shop Trading. 
BASIL BLACKWELL 
49 Broad Street Oxford England 
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